Conversion from Pdf to txt file
Started by reverendmartian
on 10/15/2012
reverendmartian
10/15/2012 1:44 pm
I had downgrade UR, MyInfo and MyBase, all for which I own licenses, because I could not view pdf files natively from within the program; whereas, by contrast, I could view pdf files as pdf files from within NVivo. I am uncertain why I resisted converting pdfs to txt files and then importing them into UR et als as txt files. But, for reasons unimportant to this post, I have been consumed for the past few days in trying to find software that reliably converted pdf to txt files and I hit upon that has been a mainstay on my hard-drive for at least a dozen years: OmniPage Pro---now at version 18. Whereas some reasonably expensives file conversion/data extraction software erroneously finds windspace in the pdf file and breaks whole words into bits of letters, OmniPage Pro has a dictionary and looks for whole words and perfectly converts pdfs into txt files. Every 18 months or so, Nuance sends me an upgrade offer for about 100 dollars and I always upgrade. I had used the software to scan paper pleadings and discovery requests and responses and then convert them into Word documents to write into them. I had never thought about using OmniPage Pro to convert court decisions and journal articles---I share a "platinum" account on the APA database with a forensic psychologist. But I am certainly going to do it now!!!! Then I will import those converted files into UR and then go at it.
Having said that, I have to quickly point out that you just cannot code in UR etc as you can in NVivo. For sure you can select text in the Document Pane (or whatever it is called) in UR, and then connect it to a node in the hierarchical tree by hyperlink, but you cannot do multi-field queries in UR etc as you can do in NVivo. Nor do I expect that you should be able to have that functionality in software that costs on BDJ less than fifty bucks in comparison to what NVivo costs.
Still and all, the usefulness of UR et als has increased manifold now that I can import perfect formatted txt files from OmniPage Pro 10.
Cheers,
Mitch
Having said that, I have to quickly point out that you just cannot code in UR etc as you can in NVivo. For sure you can select text in the Document Pane (or whatever it is called) in UR, and then connect it to a node in the hierarchical tree by hyperlink, but you cannot do multi-field queries in UR etc as you can do in NVivo. Nor do I expect that you should be able to have that functionality in software that costs on BDJ less than fifty bucks in comparison to what NVivo costs.
Still and all, the usefulness of UR et als has increased manifold now that I can import perfect formatted txt files from OmniPage Pro 10.
Cheers,
Mitch
Jon Polish
10/15/2012 1:50 pm
You have the pro version of UR? I'm surprised that you don't see text in the pane. All pdfs (not scanned which are just pictures unless you perform some OCR) are displayed as text and are searchable.
Jon
Jon
Leib Moscovitz
10/15/2012 2:08 pm
You can display PDFs internally in UR (if I'm not mistaken, even with the Standard version) by clicking Options, Browser, and then adding .PDF under File Extensions to Display in Internal Browser view; indeed, I have been doing this for years.
quant
10/15/2012 2:14 pm
reverendmartian wrote:
I had downgrade UR, MyInfo and MyBase, all for which I own licenses, because I could not
view pdf files natively from within the program; whereas, by contrast, I could view
pdf files as pdf files from within NVivo.
you can view pdfs in UR as pdf: set up to view pdf in your explorer
reverendmartian wrote:
to quickly point out that you just cannot code in UR etc as you can in NVivo. For sure you
can select text in the Document Pane (or whatever it is called) in UR, and then connect
it to a node in the hierarchical tree by hyperlink, but you cannot do multi-field
queries in UR etc as you can do in NVivo.
you can do multiline queries in UR: click on Advanced in search item
reverendmartian
10/15/2012 6:58 pm
Jon Polish wrote:
You have the pro version of UR? I'm surprised that you don't see text in the pane. All
pdfs (not scanned which are just pictures unless you perform some OCR) are displayed
as text and are searchable.
Jon
Yes, I have seen the text displayed and it is a mess.
reverendmartian
10/15/2012 6:58 pm
LM7 wrote:
You can display PDFs internally in UR (if I'm not mistaken, even with the Standard
version) by clicking Options, Browser, and then adding .PDF under File Extensions to
Display in Internal Browser view; indeed, I have been doing this for years
Thanks I will try that..
reverendmartian
10/15/2012 6:59 pm
quant wrote:
reverendmartian wrote:
>I had downgrade UR, MyInfo and MyBase, all for which I own
licenses, because I could not
>view pdf files natively from within the program;
whereas, by contrast, I could view
>pdf files as pdf files from within NVivo.
you
can view pdfs in UR as pdf: set up to view pdf in your explorer
reverendmartian
wrote:
>to quickly point out that you just cannot code in UR etc as you can in NVivo. For
sure you
>can select text in the Document Pane (or whatever it is called) in UR, and
then connect
>it to a node in the hierarchical tree by hyperlink, but you cannot do
multi-field
>queries in UR etc as you can do in NVivo.
you can do multiline queries
in UR: click on Advanced in search item
Thanks I will try that as well
reverendmartian
10/15/2012 7:05 pm
LM7 wrote:
You can display PDFs internally in UR (if I'm not mistaken, even with the Standard
version) by clicking Options, Browser, and then adding .PDF under File Extensions to
Display in Internal Browser view; indeed, I have been doing this for years.
OMG. I did it and it works like an absolute charm. Thank you so much.
reverendmartian
10/15/2012 7:14 pm
I jumped the gun for a change; in UR you can only insert hyperlinks from RTF docs to items in the tree. No hyperlinks are available from pdf or txt files. Hence I will use OmniPage to create rtf files which I will then import into UR. There is no other coding work around that I can figure out other than by creating hyperlinks between selected text in the RTF docs and items.
Thanks everyone all the same.
Cheers,
Mitch
Thanks everyone all the same.
Cheers,
Mitch
quant
10/15/2012 11:21 pm
reverendmartian wrote:
there are in pdf if you can add link in pdf - for that you need pdf editor, UR links have special ur protocol, for example
ur:///?item=6835,6833,5710,1000&pos=554
or you can still have pdf and put links into "notes" pane.
try to explain what you are trying to achieve, that will probably get you more relevant answers, alternativelly ask directly at kinooks forum ...
I jumped the gun for a change; in UR you can only insert hyperlinks from RTF docs to items
in the tree. No hyperlinks are available from pdf or txt files.
there are in pdf if you can add link in pdf - for that you need pdf editor, UR links have special ur protocol, for example
ur:///?item=6835,6833,5710,1000&pos=554
or you can still have pdf and put links into "notes" pane.
try to explain what you are trying to achieve, that will probably get you more relevant answers, alternativelly ask directly at kinooks forum ...
reverendmartian
10/16/2012 2:46 am
quant wrote:
reverendmartian wrote:
>I jumped the gun for a change; in UR you can only insert
hyperlinks from RTF docs to items
>in the tree. No hyperlinks are available from pdf
or txt files.
there are in pdf if you can add link in pdf - for that you need pdf editor,
UR links have special ur protocol, for example
ur:///?item=6835,6833,5710,1000&pos=554
or you can still have pdf and put links
into "notes" pane.
try to explain what you are trying to achieve, that will probably
get you more relevant answers, alternativelly ask directly at kinooks forum ...
I differentiate the Data Explorer Pane (left-tree with data items) from the Items Detail Pane, left where the pdf is now natively shown. Assume I highlight text on the pdf in the Items Detail Pane, assume that I want to link it to an item in tree shown in the Data Explorer. I cannot link the selected text to an item in the Data Explorer Pane. Now assume that I have an rtf displayed in the Items Detail Pane. Assume I highlight some text on the document, I believe that if it is an RTF I can hyperlink it to an item in the Data Explorer Pane.
quant
10/16/2012 8:25 pm
reverendmartian wrote:
quant wrote:
>reverendmartian wrote:
>>I jumped the gun for a change; in UR you
can only insert
>hyperlinks from RTF docs to items
>>in the tree. No hyperlinks are
available from pdf
>or txt files.
>
>there are in pdf if you can add link in pdf - for
that you need pdf editor,
>UR links have special ur protocol, for example
>
>ur:///?item=6835,6833,5710,1000&pos=554
>or you can still have pdf and put
links
>into "notes" pane.
>
>try to explain what you are trying to achieve, that
will probably
>get you more relevant answers, alternativelly ask directly at
kinooks forum ...
>
>
I differentiate the Data Explorer Pane (left-tree with data
items) from the Items Detail Pane, left where the pdf is now natively shown. Assume I
highlight text on the pdf in the Items Detail Pane, assume that I want to link it to an
item in tree shown in the Data Explorer. I cannot link the selected text to an item in the
Data Explorer Pane. Now assume that I have an rtf displayed in the Items Detail Pane.
Assume I highlight some text on the document, I believe that if it is an RTF I can
hyperlink it to an item in the Data Explorer Pane.
you can do it with pdf as well, that's exactly what UR protocol I mentioned previously is for, but to be able to to that, you need to be able to have pdf editor that can create hyperlinks, eg pdf xchange viewer.
