Hi,
I intalled Lucene search module 1.5 succefully. I added the following tags to the registry.xml but still .pdf & .doc files won't get indexed . They are type binary. Is there somthing wrong with the entries. .txt files are indexed and can be searched.
<luceneSearch>
<mergeFactor>100000</mergeFactor>
<permCheck>true</permCheck>
<indexDir>/lucene/TBOKCMS/</indexDir>
<analyzer>org.apache.lucene.analysis.standard.StandardAnalyzer</analyzer>
<subsearch>true</subsearch>
<project>online</project>
<docFactories>
<pageDocFactory enabled="true" type="page">
<class>net.grcomputing.opencms.search.lucene.PageDocument</class>
</pageDocFactory>
<plainDocFactory enabled="true" type="plain">
<fileType name="plaintext">
<extension>.txt</extension>
<class>net.grcomputing.opencms.search.lucene.PlainDocument</class>
</fileType>
<fileType name="taggedtext">
<extension>.html</extension>
<extension>.htm</extension>
<extension>.xml</extension>
<!-- This will strip tags before processing -->
<class>net.grcomputing.opencms.search.lucene.TaggedPlainDocument</class>
</fileType>
</plainDocFactory>
<jspDocFactory enabled="false" type="jsp">
<class>net.grcomputing.opencms.search.lucene.JspDocument</class>
</jspDocFactory>
<docFactory enabled="true" type="news">
<class>net.grcomputing.opencms.search.lucene.NewsDocument</class>
</docFactory>
<docFactory enabled="true" type="forum">
<class>de.wfnetz.opencms.modules.forum.ContributionDocument</class>
</docFactory>
<docFactory enabled="false" type="XML Template"/>
<docFactory enabled="true" type="binary">
<fileType name="pdftext">
<extension>.pdf</extension>
<class>net.grcomputing.opencms.search.lucene.PDFDocument</class>
</fileType>
<fileType name="wordtext">
<extension>.doc</extension>
<class>net.grcomputing.opencms.search.lucene.WordDocument</class>
</fileType>
</docFactory>
</docFactories>
<directories>
<directory location="/TBOKCMS/Documents/">
<section>TBOK CMS</section>
<subsearch>true</subsearch>
</directory>
</directories>
<contentDefinitions>
<contentDefinition type="news">
<class>com.opencms.modules.homepage.news.NewsContentDefinition</class>
<initClass>net.grcomputing.opencms.search.lucene.NewsInitialization</initClass>
<listMethod name="getNewsList">
<param type="java.lang.Integer">1</param>
<param type="java.lang.String">-1</param>
</listMethod>
<page uri="/news.html?__element=entry">
<param method="getIntId" name="newsid"/>
</page>
</contentDefinition>
<contentDefinition type="forum">
<class>de.wfnetz.opencms.modules.forum.ContributionContentDefinition</class>
<listMethod name="getSortedList">
<param type="java.lang.String"/>
</listMethod>
<page uri="/forum.html?forumtemplate=viewcontributionentry">
<param method="getId" name="conid"/>
</page>
</contentDefinition>
</contentDefinitions>
</luceneSearch>
need some help
Thanks
Miyuru C. Ratnayake<p><font face=arial size=-1>Do you Yahoo!?<br>
<a href="http://us.rd.yahoo.com/mailtag_us/*http://mail.yahoo.com" target="_blank"><b>Yahoo! Mail</a></b> - More reliable, more storage, less spam