[opencms-dev] Lucene and Binary Documents

Ben Rometsch ben at solidstategroup.com
Wed Oct 15 03:27:02 CEST 2003


Hi,

I have the Lucene module working fine, indexing HTML documents on my site. I
know you can plug in extra components to have Lucene index PDF and Microsoft
Word documents; has anyone managed to do this within OpenCMS? Are there any
steps that need to be taken differently to an out-the-box Lucene
installation? 

As an interim measure, how easy would it be to just have Lucene index the
filenames of any Word or PDF documents within a certain area of the VFS? Can
anyone provide any information on how to go about this? 

Thanks,
Ben




More information about the opencms-dev mailing list