[opencms-dev] problems searching word doc in OpenCms 6.0b1

Dan Tenenbaum dandante at dandante.com
Thu Mar 17 00:56:37 CET 2005


Hello,

Sorry if this has been answered already, but I couldn't find anything in the
archives.

I realize this may be a Lucene or POI issue, but since I am using OpenCms I
thought this was the best place to start.

To test the functionality of indexing/searching Word documents and PDF's, I
tried indexing a few random files in each format and then searching for text
known to be in the files. With the following file:

http://systemsbiology.org/extra/AffirmativeActionForm.doc

The word "Affirmative" is definitely in the file as you can see by viewing
it. However, once I index and search through the document, searching for
"Affirmative" yields no results, though I can search for other words very
close to that word and get the correct results. PDF files appear to be fully
indexed/searchable.

If this sounds like a bug, please let me know and I will file it as such. If
there is a way to tune my indexing so that this word is retrieved, let me
know. Otherwise I will probably have to use another solution, as arbitrary,
partial indexing is not really an option.

Thanks





More information about the opencms-dev mailing list