<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.2800.1276" name=GENERATOR></HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT face=Arial size=2>I just wanted to make sure you use this format and 
not the one for version 1.3 or 1.2. Of course you have to add the docFactory for 
binary files as you posted it.</FONT></DIV>
<BLOCKQUOTE 
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
  <DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
  <DIV 
  style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B> 
  <A title=dattaritwik@yahoo.com href="mailto:dattaritwik@yahoo.com">Ritwik 
  Datta</A> </DIV>
  <DIV style="FONT: 10pt arial"><B>To:</B> <A title=opencms-dev@opencms.org 
  href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A> </DIV>
  <DIV style="FONT: 10pt arial"><B>Sent:</B> Wednesday, January 21, 2004 10:54 
  AM</DIV>
  <DIV style="FONT: 10pt arial"><B>Subject:</B> Re: [opencms-dev] Problem: 
  docFactory in registry.xml for lucene word .doc file search:urgent</DIV>
  <DIV><BR></DIV>
  <DIV>Ok Thanks. But tell me one thing, there is no entry for registering class 
  for word document and pdf document search in the link you have given. I 
  mean registry.xml should have enrty for .doc and .pdf extension, 
  right?<BR><BR><B><I>"Hartmann, Waehrisch & Feykes GmbH" <<A 
  href="mailto:hartmann@waehrisch-feykes.de">hartmann@waehrisch-feykes.de</A>></I></B> 
  wrote: 
  <BLOCKQUOTE class=replbq 
  style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">
    <META content="MSHTML 6.00.2800.1276" name=GENERATOR>
    <DIV><FONT face=Arial size=2>There has been a redesign from 1.3 to 
    (inofficial) 1.4. The cvs is based on this 1.4 and you have to make sure 
    that your registry looks like the sample registry <A 
    href="http://www.aleph-null.tv/downloads/contribs/beffe/registry.txt">http://www.aleph-null.tv/downloads/contribs/beffe/registry.txt</A></FONT></DIV>
    <DIV><FONT face=Arial size=2>Also compile and copy all files from the cvs to 
    your classes folder.</FONT></DIV>
    <DIV><FONT face=Arial size=2></FONT> </DIV>
    <DIV><FONT face=Arial size=2>Bye,</FONT></DIV>
    <DIV><FONT face=Arial size=2>Stephan</FONT></DIV>
    <DIV> </DIV>
    <BLOCKQUOTE 
    style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
      <DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
      <DIV 
      style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B> 
      <A title=dattaritwik@yahoo.com href="mailto:dattaritwik@yahoo.com">Ritwik 
      Datta</A> </DIV>
      <DIV style="FONT: 10pt arial"><B>To:</B> <A title=opencms-dev@opencms.org 
      href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A> </DIV>
      <DIV style="FONT: 10pt arial"><B>Sent:</B> Wednesday, January 21, 2004 
      10:11 AM</DIV>
      <DIV style="FONT: 10pt arial"><B>Subject:</B> Re: [opencms-dev] Problem: 
      docFactory in registry.xml for lucene word .doc file search:urgent</DIV>
      <DIV><BR></DIV>
      <DIV>Dear Stephan,</DIV>
      <DIV> </DIV>
      <DIV>I have imported net.grcomputing.opencms.search.lucene_1.3.zip. That 
      version was missing search index facility for Word and Pdf documents. So I 
      downleded those java files, compiled and copied under 
      $TOMCAT-HOME/webapps/opencms/WEB-INF/classes/net/grcomputing/opencms/search/lucene.</DIV>
      <DIV>I restared tomcat after that. but no result. Pls help me.</DIV>
      <DIV>regards,</DIV>
      <DIV>Ritwik</DIV>
      <DIV> </DIV>
      <DIV> </DIV>
      <DIV> </DIV>
      <DIV><B><I>"Hartmann, Waehrisch & Feykes GmbH" 
      <hartmann@waehrisch-feykes.de></I></B> wrote:</DIV>
      <BLOCKQUOTE class=replbq 
      style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">
        <META content="MSHTML 6.00.2800.1276" name=GENERATOR>
        <STYLE></STYLE>

        <DIV><FONT face=Arial size=2>Which version of the module did you use 
        before? Did you copy only those to classes or all together?</FONT><FONT 
        face=Arial size=2> Did you restart tomcat?</FONT></DIV>
        <DIV><FONT face=Arial size=2></FONT> </DIV>
        <DIV><FONT face=Arial size=2>Regards,</FONT></DIV>
        <DIV><FONT face=Arial size=2>Stephan</FONT></DIV>
        <BLOCKQUOTE 
        style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
          <DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
          <DIV 
          style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B> 
          <A title=dattaritwik@yahoo.com 
          href="mailto:dattaritwik@yahoo.com">Ritwik Datta</A> </DIV>
          <DIV style="FONT: 10pt arial"><B>To:</B> <A 
          title=opencms-dev@opencms.org 
          href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A> 
          </DIV>
          <DIV style="FONT: 10pt arial"><B>Sent:</B> Wednesday, January 21, 2004 
          7:16 AM</DIV>
          <DIV style="FONT: 10pt arial"><B>Subject:</B> [opencms-dev] Problem: 
          docFactory in registry.xml for lucene word .doc file 
          search:urgent</DIV>
          <DIV><BR></DIV>
          <P>Dear All,</P>
          <P>I have complied opencms lucene source from CVS repositories. I have 
          got WordDocument.class and I_Documentfactory.class under 
          net.grcomputing.opencms.search.lucene package. Now I uploaded those 
          files under 
          $TOMCAT-HOME/webapps/opencms/WEB-INF/classes/net/grcomputing/opencms/search/lucene. 
          I also uploaded third party tm-extractors-0.2.jar under 
          $TOMCAT-HOME/webapps/opencms/WEB-INF/lib/</P>
          <P>Now I have changed <docFactory> in registry.xml for lucene 
          word .doc file search. Here is segment of registry.xml </P>
          <P><docFactories>......</P>
          <P><docFactory type="binary" 
          enabled="true"><BR>     <fileType 
          name="doctext"><BR>      <extension>.doc</extension><BR>      <extension>.dot</extension><BR>      <class>net.grcomputing.opencms.search.lucene.WordDocument</class><BR>     </fileType><BR>    </docFactory></P>
          <P>..........</P>
          <P></docFactories>.</P>
          <P>Mow when I run crond scheduler, indexing is successful but there is 
          no trace of indexing my doc files. I also checked it from 
          simple_search.jsp. It is unable to hit url of my word docs even search 
          criteria is met. I am attaching logs of index manager. There is trace 
          of loading Page DocumentFactory, JSP DocumentFactory, Plain 
          DocumentFactory, But not my word Document factory. I think I am 
          missing something. can anyone tell me the catch? It is pretty urgent. 
          pls help me</P>
          <P>=====IndexManager=============================================================<BR>[21.01.2004 
          11:36:10] <opencms_info> Analyzer: 
          org.apache.lucene.analysis.standard.StandardAnalyzer<BR>[21.01.2004 
          11:36:10] <opencms_info> Page DocumentFactory 
          loaded<BR>[21.01.2004 11:36:10] <opencms_info> JSP 
          DocumentFactory loaded<BR>[21.01.2004 11:36:10] <opencms_info> 
          Plain DocumentFactory loaded<BR>[21.01.2004 11:36:10] 
          <opencms_info> Extension map exists to handle 
          plaintext<BR>[21.01.2004 11:36:10] <opencms_info> Extension map 
          exists to handle taggedtext<BR>[21.01.2004 11:36:10] 
          <opencms_info> IndexManager: indexing /release/<BR>[21.01.2004 
          11:36:10] <opencms_info> IndexManager: indexing 
          /release/spdb/<BR>[21.01.2004 11:36:10] <opencms_info> 
          IndexManager: indexing 
          /release/spdb/Assessment_Findings/<BR>[21.01.2004 11:36:10] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Best_Practices/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Business_Goals/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/CMC_Product_Information/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/CMM_Action_Plans/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Coding_Standard/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Dashboard/<BR>[21.01.2004 11:36:11] <opencms_info> 
          IndexManager: indexing /release/spdb/Defect_Prevention/<BR>[21.01.2004 
          11:36:11] <opencms_info> IndexManager: indexing 
          /release/spdb/ER_SI_Organisation_Structure/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Estimation/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Expert_List/<BR>[21.01.2004 11:36:11] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/FAQ/<BR>[21.01.2004 11: 36:12] <opencms_info> 
          IndexManager: indexing 
          /release/spdb/IGC_OSSP_Role_Mapping/<BR>[21.01.2004 11:36:12] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Metrics_and_Measurements/<BR>[21.01.2004 11:36:12] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/OQPM/<BR>[21.01.2004 11:36:12] <opencms_info> 
          IndexManager: indexing /release/spdb/OSSP/<BR>[21.01.2004 11:36:12] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Presentation_Library/<BR>[21.01.2004 11:36:12] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Process_Change_Management/<BR>[21.01.2004 11:36:12] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Projectwise_Plans/<BR>[21.01.2004 11:36:12] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/PROMPT/<BR>[21.01.2004 11:36:12] <opencms_info> 
          IndexManager: indexing /release/spdb/Readables/<BR>[21.01.2004 
          11:36:12] <opencms_info> IndexManager: indexing 
          /release/spdb/Sample_CMM_Documents/<BR>[21.01.2004 11:36:13] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/SCM/<BR>[21.01.2004 11:36:13] <opencms_info> 
          IndexManager: indexing /release/spdb/SEPG/<BR>[21.01.2004 11:36:13] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/SPDB_Notes/<BR>[21.01.2004 11:36:13] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/SPDB_Search/<BR>[21.01.2004 11:36:13] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/SQA/<BR>[21.01.2004 11:36:13] <opencms_info> 
          IndexManager: indexing /release/spdb/TCM/<BR>[21.01.2004 11:36:13] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Notes/<BR>[21.01.2004 11:36:13] <opencms_info> 
          IndexManager: indexing /release/spdb/TCM/Others/<BR>[21.01.2004 
          11:36:13] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/<BR>[21.01.2004 11:36:14] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Data/<BR>[21.01.2004 11:36:14] 
          <opencms_info> IndexM anager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Bilingual_2-tier_Application_to_3-tier_Conversion/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Citrix/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Compilation_Problem/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Driver_Installation/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/FTP_Service_on_Linux/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Hindi_Email/<BR>[21.01.2004 
          11:36:14] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Hindi_Integration_Development_Guidelines/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/HW_Requirement_for_Oracle9i_9iDS_9iASR2/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Oracle_9i_Application_Server_Release2_Installation/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Oracle_Forms9i_to_Forms6i_Conversion/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Oracle_Froms6i_Deployment_on_9iAS/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/ORARRP_Reusable_Components/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/OS_Problem/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Asset_Details/Red_Hat_Advance_Server_Installation/<BR>[21.01.2004 
          11:36:15] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Project_Info/<BR>[21.01.2004 
          11:36:16] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Register/<BR>[21.01.2004 11:36:16] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Reusable_Assets/Training_Materials/<BR>[21.01.2004 
          11:36:16] <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/TCM_Plans/<BR>[21.01.2004 11:36:16] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/TCM/Templates/<BR>[21.01.2004 11:36:16] 
          <opencms_info> IndexManager: indexing 
          /release/spdb/Timesheet/<BR>[21.01.2004 11:36:16] <opencms_info> 
          IndexManager: indexing /release/spdb/Training/<BR>[21.01.2004 
          11:36:17] <opencms_in fo> IndexManager: 55 documents are being 
          processed<BR>[21.01.2004 11:36:17] <opencms_info> 
          IndexManager:  Index has been optimized.<BR>[21.01.2004 11:36:17] 
          <opencms_info> 
          Done<BR>=====IndexManager=============================================================<BR>[21.01.2004 
          11:36:17] <opencms_cronscheduler> Successful launch of job 
          com.opencms.core.CmsCronEntry{36 11 * * * Admin Administrators 
          net.grcomputing.opencms.search.lucene.CronIndexManager 
          createIndex=true} Message: CronIndexManager rebuilt the Lucene index 
          on Wed Jan 21 11:36:17 IST 2004</P>
          <P>Regards,</P>
          <P>Ritwik<BR></P>
          <P>
          <HR SIZE=1>
          Do you Yahoo!?<BR>Yahoo! Hotjobs: <A 
          href="http://pa.yahoo.com/*http://us.rd.yahoo.com/hotjobs/mail_footer_email/evt=21482/*http://hotjobs.sweepstakes.yahoo.com/signingbonus">Enter 
          the "Signing Bonus" Sweepstakes</A></BLOCKQUOTE></BLOCKQUOTE>
      <P>
      <HR SIZE=1>
      Do you Yahoo!?<BR>Yahoo! Hotjobs: <A 
      href="http://pa.yahoo.com/*http://us.rd.yahoo.com/hotjobs/mail_footer_email/evt=21482/*http://hotjobs.sweepstakes.yahoo.com/signingbonus">Enter 
      the "Signing Bonus" Sweepstakes</A></BLOCKQUOTE></BLOCKQUOTE></DIV>
  <P>
  <HR SIZE=1>
  Do you Yahoo!?<BR>Yahoo! Hotjobs: <A 
  href="http://pa.yahoo.com/*http://us.rd.yahoo.com/hotjobs/mail_footer_email/evt=21482/*http://hotjobs.sweepstakes.yahoo.com/signingbonus">Enter 
  the "Signing Bonus" Sweepstakes</A></BLOCKQUOTE></BODY></HTML>