<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.2800.1276" name=GENERATOR></HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT face=Arial size=2>I just wanted to make sure you use this format and
not the one for version 1.3 or 1.2. Of course you have to add the docFactory for
binary files as you posted it.</FONT></DIV>
<BLOCKQUOTE
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
<DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
<DIV
style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B>
<A title=dattaritwik@yahoo.com href="mailto:dattaritwik@yahoo.com">Ritwik
Datta</A> </DIV>
<DIV style="FONT: 10pt arial"><B>To:</B> <A title=opencms-dev@opencms.org
href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A> </DIV>
<DIV style="FONT: 10pt arial"><B>Sent:</B> Wednesday, January 21, 2004 10:54
AM</DIV>
<DIV style="FONT: 10pt arial"><B>Subject:</B> Re: [opencms-dev] Problem:
docFactory in registry.xml for lucene word .doc file search:urgent</DIV>
<DIV><BR></DIV>
<DIV>Ok Thanks. But tell me one thing, there is no entry for registering class
for word document and pdf document search in the link you have given. I
mean registry.xml should have enrty for .doc and .pdf extension,
right?<BR><BR><B><I>"Hartmann, Waehrisch & Feykes GmbH" <<A
href="mailto:hartmann@waehrisch-feykes.de">hartmann@waehrisch-feykes.de</A>></I></B>
wrote:
<BLOCKQUOTE class=replbq
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">
<META content="MSHTML 6.00.2800.1276" name=GENERATOR>
<DIV><FONT face=Arial size=2>There has been a redesign from 1.3 to
(inofficial) 1.4. The cvs is based on this 1.4 and you have to make sure
that your registry looks like the sample registry <A
href="http://www.aleph-null.tv/downloads/contribs/beffe/registry.txt">http://www.aleph-null.tv/downloads/contribs/beffe/registry.txt</A></FONT></DIV>
<DIV><FONT face=Arial size=2>Also compile and copy all files from the cvs to
your classes folder.</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Bye,</FONT></DIV>
<DIV><FONT face=Arial size=2>Stephan</FONT></DIV>
<DIV> </DIV>
<BLOCKQUOTE
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
<DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
<DIV
style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B>
<A title=dattaritwik@yahoo.com href="mailto:dattaritwik@yahoo.com">Ritwik
Datta</A> </DIV>
<DIV style="FONT: 10pt arial"><B>To:</B> <A title=opencms-dev@opencms.org
href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A> </DIV>
<DIV style="FONT: 10pt arial"><B>Sent:</B> Wednesday, January 21, 2004
10:11 AM</DIV>
<DIV style="FONT: 10pt arial"><B>Subject:</B> Re: [opencms-dev] Problem:
docFactory in registry.xml for lucene word .doc file search:urgent</DIV>
<DIV><BR></DIV>
<DIV>Dear Stephan,</DIV>
<DIV> </DIV>
<DIV>I have imported net.grcomputing.opencms.search.lucene_1.3.zip. That
version was missing search index facility for Word and Pdf documents. So I
downleded those java files, compiled and copied under
$TOMCAT-HOME/webapps/opencms/WEB-INF/classes/net/grcomputing/opencms/search/lucene.</DIV>
<DIV>I restared tomcat after that. but no result. Pls help me.</DIV>
<DIV>regards,</DIV>
<DIV>Ritwik</DIV>
<DIV> </DIV>
<DIV> </DIV>
<DIV> </DIV>
<DIV><B><I>"Hartmann, Waehrisch & Feykes GmbH"
<hartmann@waehrisch-feykes.de></I></B> wrote:</DIV>
<BLOCKQUOTE class=replbq
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">
<META content="MSHTML 6.00.2800.1276" name=GENERATOR>
<STYLE></STYLE>
<DIV><FONT face=Arial size=2>Which version of the module did you use
before? Did you copy only those to classes or all together?</FONT><FONT
face=Arial size=2> Did you restart tomcat?</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Regards,</FONT></DIV>
<DIV><FONT face=Arial size=2>Stephan</FONT></DIV>
<BLOCKQUOTE
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
<DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
<DIV
style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B>
<A title=dattaritwik@yahoo.com
href="mailto:dattaritwik@yahoo.com">Ritwik Datta</A> </DIV>
<DIV style="FONT: 10pt arial"><B>To:</B> <A
title=opencms-dev@opencms.org
href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A>
</DIV>
<DIV style="FONT: 10pt arial"><B>Sent:</B> Wednesday, January 21, 2004
7:16 AM</DIV>
<DIV style="FONT: 10pt arial"><B>Subject:</B> [opencms-dev] Problem:
docFactory in registry.xml for lucene word .doc file
search:urgent</DIV>
<DIV><BR></DIV>
<P>Dear All,</P>
<P>I have complied opencms lucene source from CVS repositories. I have
got WordDocument.class and I_Documentfactory.class under
net.grcomputing.opencms.search.lucene package. Now I uploaded those
files under
$TOMCAT-HOME/webapps/opencms/WEB-INF/classes/net/grcomputing/opencms/search/lucene.
I also uploaded third party tm-extractors-0.2.jar under
$TOMCAT-HOME/webapps/opencms/WEB-INF/lib/</P>
<P>Now I have changed <docFactory> in registry.xml for lucene
word .doc file search. Here is segment of registry.xml </P>
<P><docFactories>......</P>
<P><docFactory type="binary"
enabled="true"><BR> <fileType
name="doctext"><BR> <extension>.doc</extension><BR> <extension>.dot</extension><BR> <class>net.grcomputing.opencms.search.lucene.WordDocument</class><BR> </fileType><BR> </docFactory></P>
<P>..........</P>
<P></docFactories>.</P>
<P>Mow when I run crond scheduler, indexing is successful but there is
no trace of indexing my doc files. I also checked it from
simple_search.jsp. It is unable to hit url of my word docs even search
criteria is met. I am attaching logs of index manager. There is trace
of loading Page DocumentFactory, JSP DocumentFactory, Plain
DocumentFactory, But not my word Document factory. I think I am
missing something. can anyone tell me the catch? It is pretty urgent.
pls help me</P>
<P>=====IndexManager=============================================================<BR>[21.01.2004
11:36:10] <opencms_info> Analyzer:
org.apache.lucene.analysis.standard.StandardAnalyzer<BR>[21.01.2004
11:36:10] <opencms_info> Page DocumentFactory
loaded<BR>[21.01.2004 11:36:10] <opencms_info> JSP
DocumentFactory loaded<BR>[21.01.2004 11:36:10] <opencms_info>
Plain DocumentFactory loaded<BR>[21.01.2004 11:36:10]
<opencms_info> Extension map exists to handle
plaintext<BR>[21.01.2004 11:36:10] <opencms_info> Extension map
exists to handle taggedtext<BR>[21.01.2004 11:36:10]
<opencms_info> IndexManager: indexing /release/<BR>[21.01.2004
11:36:10] <opencms_info> IndexManager: indexing
/release/spdb/<BR>[21.01.2004 11:36:10] <opencms_info>
IndexManager: indexing
/release/spdb/Assessment_Findings/<BR>[21.01.2004 11:36:10]
<opencms_info> IndexManager: indexing
/release/spdb/Best_Practices/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/Business_Goals/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/CMC_Product_Information/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/CMM_Action_Plans/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/Coding_Standard/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/Dashboard/<BR>[21.01.2004 11:36:11] <opencms_info>
IndexManager: indexing /release/spdb/Defect_Prevention/<BR>[21.01.2004
11:36:11] <opencms_info> IndexManager: indexing
/release/spdb/ER_SI_Organisation_Structure/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/Estimation/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/Expert_List/<BR>[21.01.2004 11:36:11]
<opencms_info> IndexManager: indexing
/release/spdb/FAQ/<BR>[21.01.2004 11: 36:12] <opencms_info>
IndexManager: indexing
/release/spdb/IGC_OSSP_Role_Mapping/<BR>[21.01.2004 11:36:12]
<opencms_info> IndexManager: indexing
/release/spdb/Metrics_and_Measurements/<BR>[21.01.2004 11:36:12]
<opencms_info> IndexManager: indexing
/release/spdb/OQPM/<BR>[21.01.2004 11:36:12] <opencms_info>
IndexManager: indexing /release/spdb/OSSP/<BR>[21.01.2004 11:36:12]
<opencms_info> IndexManager: indexing
/release/spdb/Presentation_Library/<BR>[21.01.2004 11:36:12]
<opencms_info> IndexManager: indexing
/release/spdb/Process_Change_Management/<BR>[21.01.2004 11:36:12]
<opencms_info> IndexManager: indexing
/release/spdb/Projectwise_Plans/<BR>[21.01.2004 11:36:12]
<opencms_info> IndexManager: indexing
/release/spdb/PROMPT/<BR>[21.01.2004 11:36:12] <opencms_info>
IndexManager: indexing /release/spdb/Readables/<BR>[21.01.2004
11:36:12] <opencms_info> IndexManager: indexing
/release/spdb/Sample_CMM_Documents/<BR>[21.01.2004 11:36:13]
<opencms_info> IndexManager: indexing
/release/spdb/SCM/<BR>[21.01.2004 11:36:13] <opencms_info>
IndexManager: indexing /release/spdb/SEPG/<BR>[21.01.2004 11:36:13]
<opencms_info> IndexManager: indexing
/release/spdb/SPDB_Notes/<BR>[21.01.2004 11:36:13]
<opencms_info> IndexManager: indexing
/release/spdb/SPDB_Search/<BR>[21.01.2004 11:36:13]
<opencms_info> IndexManager: indexing
/release/spdb/SQA/<BR>[21.01.2004 11:36:13] <opencms_info>
IndexManager: indexing /release/spdb/TCM/<BR>[21.01.2004 11:36:13]
<opencms_info> IndexManager: indexing
/release/spdb/TCM/Notes/<BR>[21.01.2004 11:36:13] <opencms_info>
IndexManager: indexing /release/spdb/TCM/Others/<BR>[21.01.2004
11:36:13] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/<BR>[21.01.2004 11:36:14]
<opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Data/<BR>[21.01.2004 11:36:14]
<opencms_info> IndexM anager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Bilingual_2-tier_Application_to_3-tier_Conversion/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Citrix/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Compilation_Problem/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Driver_Installation/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/FTP_Service_on_Linux/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Hindi_Email/<BR>[21.01.2004
11:36:14] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Hindi_Integration_Development_Guidelines/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/HW_Requirement_for_Oracle9i_9iDS_9iASR2/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Oracle_9i_Application_Server_Release2_Installation/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Oracle_Forms9i_to_Forms6i_Conversion/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Oracle_Froms6i_Deployment_on_9iAS/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/ORARRP_Reusable_Components/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/OS_Problem/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Asset_Details/Red_Hat_Advance_Server_Installation/<BR>[21.01.2004
11:36:15] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Project_Info/<BR>[21.01.2004
11:36:16] <opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Register/<BR>[21.01.2004 11:36:16]
<opencms_info> IndexManager: indexing
/release/spdb/TCM/Reusable_Assets/Training_Materials/<BR>[21.01.2004
11:36:16] <opencms_info> IndexManager: indexing
/release/spdb/TCM/TCM_Plans/<BR>[21.01.2004 11:36:16]
<opencms_info> IndexManager: indexing
/release/spdb/TCM/Templates/<BR>[21.01.2004 11:36:16]
<opencms_info> IndexManager: indexing
/release/spdb/Timesheet/<BR>[21.01.2004 11:36:16] <opencms_info>
IndexManager: indexing /release/spdb/Training/<BR>[21.01.2004
11:36:17] <opencms_in fo> IndexManager: 55 documents are being
processed<BR>[21.01.2004 11:36:17] <opencms_info>
IndexManager: Index has been optimized.<BR>[21.01.2004 11:36:17]
<opencms_info>
Done<BR>=====IndexManager=============================================================<BR>[21.01.2004
11:36:17] <opencms_cronscheduler> Successful launch of job
com.opencms.core.CmsCronEntry{36 11 * * * Admin Administrators
net.grcomputing.opencms.search.lucene.CronIndexManager
createIndex=true} Message: CronIndexManager rebuilt the Lucene index
on Wed Jan 21 11:36:17 IST 2004</P>
<P>Regards,</P>
<P>Ritwik<BR></P>
<P>
<HR SIZE=1>
Do you Yahoo!?<BR>Yahoo! Hotjobs: <A
href="http://pa.yahoo.com/*http://us.rd.yahoo.com/hotjobs/mail_footer_email/evt=21482/*http://hotjobs.sweepstakes.yahoo.com/signingbonus">Enter
the "Signing Bonus" Sweepstakes</A></BLOCKQUOTE></BLOCKQUOTE>
<P>
<HR SIZE=1>
Do you Yahoo!?<BR>Yahoo! Hotjobs: <A
href="http://pa.yahoo.com/*http://us.rd.yahoo.com/hotjobs/mail_footer_email/evt=21482/*http://hotjobs.sweepstakes.yahoo.com/signingbonus">Enter
the "Signing Bonus" Sweepstakes</A></BLOCKQUOTE></BLOCKQUOTE></DIV>
<P>
<HR SIZE=1>
Do you Yahoo!?<BR>Yahoo! Hotjobs: <A
href="http://pa.yahoo.com/*http://us.rd.yahoo.com/hotjobs/mail_footer_email/evt=21482/*http://hotjobs.sweepstakes.yahoo.com/signingbonus">Enter
the "Signing Bonus" Sweepstakes</A></BLOCKQUOTE></BODY></HTML>