Hi,
I intalled Lucene search module 1.5 succefully. I added the following tags to the registry.xml but still .pdf & .doc files won't get indexed . They are type binary. Is there somthing wrong with the entries. .txt files are indexed and can be searched.
<luceneSearch>
            <mergeFactor>100000</mergeFactor>
            <permCheck>true</permCheck>
            <indexDir>/lucene/TBOKCMS/</indexDir>
            <analyzer>org.apache.lucene.analysis.standard.StandardAnalyzer</analyzer>
            <subsearch>true</subsearch>
            <project>online</project>
            <docFactories>
                <pageDocFactory enabled="true" type="page">
                    <class>net.grcomputing.opencms.search.lucene.PageDocument</class>
                </pageDocFactory>
                <plainDocFactory enabled="true" type="plain">
                    <fileType name="plaintext">
                        <extension>.txt</extension>
                        <class>net.grcomputing.opencms.search.lucene.PlainDocument</class>
                    </fileType>
                    <fileType name="taggedtext">
                        <extension>.html</extension>
                        <extension>.htm</extension>
                        <extension>.xml</extension>
                        <!-- This will strip tags before processing -->
                        <class>net.grcomputing.opencms.search.lucene.TaggedPlainDocument</class>
                    </fileType>
                </plainDocFactory>
    <jspDocFactory enabled="false" type="jsp">
     <class>net.grcomputing.opencms.search.lucene.JspDocument</class>
    </jspDocFactory>
    <docFactory enabled="true" type="news">
                    <class>net.grcomputing.opencms.search.lucene.NewsDocument</class>
                </docFactory>
                <docFactory enabled="true" type="forum">
                    <class>de.wfnetz.opencms.modules.forum.ContributionDocument</class>
                </docFactory>
    <docFactory enabled="false" type="XML Template"/>
    
    <docFactory enabled="true" type="binary">
                    <fileType name="pdftext">
                        <extension>.pdf</extension>
                        <class>net.grcomputing.opencms.search.lucene.PDFDocument</class>
                    </fileType>
                    <fileType name="wordtext">
                        <extension>.doc</extension>
                        <class>net.grcomputing.opencms.search.lucene.WordDocument</class>
                    </fileType>
                </docFactory>    
   </docFactories>
            <directories>
                <directory location="/TBOKCMS/Documents/">
                    <section>TBOK CMS</section>
                    <subsearch>true</subsearch>
                </directory>
            </directories>
            <contentDefinitions>
                <contentDefinition type="news">
                    <class>com.opencms.modules.homepage.news.NewsContentDefinition</class>
                    <initClass>net.grcomputing.opencms.search.lucene.NewsInitialization</initClass>
                    <listMethod name="getNewsList">
                        <param type="java.lang.Integer">1</param>
                        <param type="java.lang.String">-1</param>
                    </listMethod>
                    <page uri="/news.html?__element=entry">
                        <param method="getIntId" name="newsid"/>
                    </page>
                </contentDefinition>
                <contentDefinition type="forum">
                    <class>de.wfnetz.opencms.modules.forum.ContributionContentDefinition</class>
                    <listMethod name="getSortedList">
                        <param type="java.lang.String"/>
                    </listMethod>
                    <page uri="/forum.html?forumtemplate=viewcontributionentry">
                        <param method="getId" name="conid"/>
                    </page>
                </contentDefinition>
            </contentDefinitions>
        </luceneSearch>
need some help
Thanks
Miyuru C. Ratnayake<p><font face=arial size=-1>Do you Yahoo!?<br>
<a href="http://us.rd.yahoo.com/mailtag_us/*http://mail.yahoo.com" target="_blank"><b>Yahoo! Mail</a></b> - More reliable, more storage, less spam