Processing Ajax...

Title
Close Dialog

Message

Confirm
Close Dialog

Confirm
Close Dialog

PDF Search

User Image
drapkin11
3 discussion posts
I am trying to search inside a pdf file. FileSeek shows that the query term is found, however, does not list associated line numbers. Attached is the view of the output. While my file contains multiple instances of the word 'the', FileSeek returns only a single row.

In trying to resolve this problem, I considered that maybe the particular file handler FileSeek is using to search within pdf files is the problem. My preferred pdf reader is NitroPDF, and my Windows 7 associates pdf files with that app instead of Adobe Reader. However, when I look at the file handles, I notice that FileSeek uses Adobe's file handler to search within pdfs. It appears as though there is no way to make sure FileSeek associates a different application with pdf files.

So, this approach didn't resolve the problem either. Am I expecting FileSeek to function in a way it wasn't intended to, expecting FileSeek to return all the instances of found query items instead of a single row when multiple instances of a query item was found?

I hope that all made sense! :-)
• Attachment: pdf_query_output.jpg [123,086 bytes]
pdf_query_output.jpg
Jan 10, 2011  • #1
Jon Tackabury (BFS)'s profile on WallpaperFusion.com
Unfortunately, when searching files using the IFilters (the file handlers) they don't return the correct line numbers, so none are shown. As a result, FileSeek only shows 1 match if it is found, otherwise you would have lots of matches that look identical. Sorry for the limitation, I hope this explanation helps. :)
Jan 10, 2011  • #2
User Image
drapkin11
3 discussion posts
Jon, if I understand you correctly, IFilters can let the calling program know if they found the sought text or not, but not *where* the IFilter found it. So, the problem lies with an IFilter, and not its implementation? That still leaves me wondering what the most robust solution to searching within PDFs is.
Jan 12, 2011  • #3
Jon Tackabury (BFS)'s profile on WallpaperFusion.com
I would suggest using FileSeek to find the PDFs that contain the text you are looking for, then open them in Adobe Reader and search again to find the exact location in the PDF. The IFilters leave a bit to be desired, but at least they provide a basic way to search binary files like PDFs. :)
Jan 15, 2011  • #4
User Image
drapkin11
3 discussion posts
I agree - thanks Jon!
Jan 17, 2011  • #5
Jon Tackabury (BFS)'s profile on WallpaperFusion.com
No problem, enjoy! :)
Jan 17, 2011  • #6
Was this helpful?  Login to Vote(-)  Login to Vote(-)