Re: Hi... Need your advice
< Next Message | Back to archived message list | Previous Message >
Note: This message is from the outliners.com archive kindly provided by Dave Winer.
Outliners.com Message ID: 3716
Posted by graham.smith
2005-08-10 10:12:36
In spite of the lack of indexing Zoot is very fast, and I have just confirmed that it does indeed search the entire linked file, and not just the 32K that is imported into Zoot.
I still think that Zoot has to come close to being ideal for this. I know it isn’t 100’s of megabytes, but I just searchd 48Mb of PDFs (via the matching txt files) in exactly the same time as it took me to type the 12 letter keyword that I had added at the end of one of the files to check that Zoot was searching the whole file.
Zoot searches as you type, narrowing the search as you add letters. There is also and advanced search. This is on a sluggish 1.4Khz CPU box.
Even Copernic won’t search image based PDFs so unless you are sure that all the PDFs are text files this will be a problem.
The reason that I have the problem is that one of the online bibliographic libraries that I download scientific papers from, uses PDF images files and are un-searchable until I OCR them. Originally I OCRd them and created new PDFs, but the txt and Zoot solution is much better.
I have never used KnowledgeWorkshop, but I think people have been less than impressed with it.
Graham