Processing Ajax...

Title
Close Dialog

Message

Confirm
Close Dialog

Confirm
Close Dialog

Query - false positives

User Image
alisam
5 discussion posts
I searched for the word 'yen' (as in Japanese Yen) and Fileseek 1.7.1 found word documents which I was not expecting.

Fileseek shows a 'Line Number' e.g. 1063, but the 'Line' shows rubbish.
At the bottom of the screen in its own window it shows a lot of rubbish but the word 'yen' is clearly seen
For example:

}{yen��2�����IX�����hZ������c

(It hasn't pasted in exactly but you can see why it has picked up the word 'yen'.

Now that I understand what it is doing I can work around it. But I would like to know what it is finding.
Mar 31, 2009  • #1
Jon Tackabury (BFS)'s profile on WallpaperFusion.com
Word documents are binary, and are not easily searchable. A "line" in a binary file could be thousands of characters long, and isn't the same as a line in a text file. Also, if any compression is used in the file (like the new Word 2007 .docx files) then you will almost never find any matches for your words. In order to correctly search these types of files, a search program would have to be able to load and parse the file. Unfortunately, FileSeek can't do this, as it would mean coding in dozens of file parsers and just isn't practical.
Apr 6, 2009  • #2
Was this helpful?  Login to Vote(-)  Login to Vote(-)