[opencms-dev] PDF and Word files indexation with Lucene module 1.5

Ernesto De Santis ernesto.desantis at colaborativa.net
Mon Mar 1 17:48:02 CET 2004


Hi Federico

                <docFactory enabled="true" type="binary">
                    <fileType name="pdftext">
                        <extension>.pdf</extension>

<class>net.grcomputing.opencms.search.lucene.PDFDocument</class>
                    </fileType>
                    <fileType name="wordtext">
                        <extension>.doc</extension>

<class>net.grcomputing.opencms.search.lucene.WordDocument</class>
                    </fileType>
                </docFactory>

Good luck
Ernesto.


----- Original Message ----- 
From: "Federico Grilli" <federicog at gruppoinit.it>
To: <opencms-dev at opencms.org>
Sent: Monday, March 01, 2004 1:31 PM
Subject: [opencms-dev] PDF and Word files indexation with Lucene module 1.5


Hello everybody,

I have successfully installed Lucene module 1.5. Everything works fine with
txt and html files, while I cannot figure out how to index pdf and word
documents. Could somebody please tell me which entries I need to add to the
registry.xml to get it work?

Thanks in advance,

Federico Grilli




More information about the opencms-dev mailing list