<html>
<body>
Thanks a lot Olli. Finally got it to work.<br><br>
Cheers,<br>
Kelvin<br><br>
<blockquote type=cite class=cite cite=""><font face="Times New Roman, Times" color="#FF0000">Here
is a resend, since I seemed to miss the crucial word ‘off’ from my
previous mail</font><font face="wingdings" color="#FF0000">J</font> The
configuration below should work. The reason why your original
configuration does not work is because the cron is trying to index some
content definitions as defined in the configuration, which your site most
likely doesn’t have.<br>
<font face="Times New Roman, Times" color="#FF0000"> <br>
Olli<br>
</font><font face="Times New Roman, Times"> <br>
<?xml version="1.0"
encoding="ISO-8859-1"?><br>
<registry><br>
<system><br>
- <luceneSearch><br>
- <!-- <br>
<x-tab> </x-tab> -
mergeFactor and permCheck are currently ignored.<br>
<x-tab> </x-tab>
<br><br>
--> <br>
<mergeFactor>100000</mergeFactor> <br>
<permCheck>true</permCheck> <br>
- <!-- <br>
<x-tab> </x-tab> -
directory in which lucene will store its indexes. Note: this is
real<br>
<x-tab> </x-tab> -
fs, not VFS.<br>
<x-tab> </x-tab>
<br><br>
--> <br>
<indexDir>C:\luceneindex</indexDir> <br>
- <!-- <indexDir>F:\luceneindex\</indexDir> <br>
--> <br>
- <!-- <br>
<x-tab> </x-tab> -
The analyzer is used for parsing documents. Choose one for your <br>
<x-tab> </x-tab> -
language. If language is English, use the StandardAnalyzer.<br>
<x-tab> </x-tab> -
There are additional analyzers at
<a href="http://jakarta.apache.org/lucene" eudora="autourl">http://jakarta.apache.org/lucene</a><br>
<x-tab> </x-tab>
<br><br>
--> <br>
<analyzer>org.apache.lucene.analysis.standard.StandardAnalyzer</analyzer>
<br>
- <!--
<analyzer>org.apache.lucene.analysis.de.GermanAnalyzer</analyzer>
<br>
--> <br>
- <!-- <br>
<x-tab> </x-tab> -
If subsearch is true, subfolders will be searched by default.<br>
<x-tab> </x-tab> -
This can be turned on/off per directory.<br>
<x-tab> </x-tab>
<br><br>
--> <br>
<subsearch>true</subsearch> <br>
- <!-- <br>
<x-tab> </x-tab> -
Name of the project to index. Online is recommended.<br>
<x-tab> </x-tab>
<br><br>
--> <br>
<project>online</project> <br>
- <!-- <br>
<x-tab> </x-tab> -
docFactories determine how documents are processed. Generally, one<br>
<x-tab> </x-tab> -
docFactory exists for each type of content (viz. JSP, Page, Plain) <br>
<x-tab> </x-tab> -
that you want to index.<br>
<x-tab> </x-tab>
<br><br>
--> <br>
- <docFactories><br>
- <!-- <br>
<x-tab> </x-tab>
- This docFactory indexes documents with type page (e.g. HTML <br>
<x-tab> </x-tab><x-tab> </x-tab>-
files edited with the WYSIWYG editor). <br>
<x-tab> </x-tab>
<br><br>
--> <br>
- <docFactory enabled="true" type="page"><br>
<class>net.grcomputing.opencms.search.lucene.PageDocument</class>
<br>
</docFactory><br>
- <!-- <br>
<x-tab> </x-tab>
- This docFactory is a little more complex. It takes documents of<br>
<x-tab> </x-tab><x-tab> </x-tab>-
type "plain" and determines, by extension, what class should
be<br>
<x-tab> </x-tab><x-tab> </x-tab>-
used to index each particular file. In this example, we want to<br>
<x-tab> </x-tab><x-tab> </x-tab>-
index plain text files exactly as they are, but any files that <br>
<x-tab> </x-tab><x-tab> </x-tab>-
contain tags need the tags stripped out before they are indexed.<br>
<x-tab> </x-tab><x-tab> </x-tab>-<br>
<x-tab> </x-tab><x-tab> </x-tab>-
Note that the name="" attribute is simply for pretty output,
and <br>
<x-tab> </x-tab><x-tab> </x-tab>-
can contain any allowable PCDATA text.<br>
<x-tab> </x-tab><x-tab> </x-tab><br><br>
--> <br>
- <docFactory enabled="true"
type="plain"><br>
- <fileType name="plaintext"><br>
<extension>.txt</extension> <br>
<class>net.grcomputing.opencms.search.lucene.PlainDocument</class>
<br>
</fileType><br>
- <fileType name="taggedtext"><br>
<extension>.html</extension> <br>
<extension>.htm</extension> <br>
<extension>.xml</extension> <br>
- <!-- This will strip tags before processing <br>
--> <br>
<class>net.grcomputing.opencms.search.lucene.TaggedPlainDocument</class>
<br>
</fileType><br>
</docFactory><br>
- <!-- <br>
<x-tab> </x-tab>
- This will strip JSP tags and all scriptlets. IT WILL NOT RENDER
THE<br>
<x-tab> </x-tab><x-tab> </x-tab>-
JSP FIRST, as JSPs are, by nature, dynamic.<br>
<x-tab> </x-tab><x-tab> </x-tab>-<br>
<x-tab> </x-tab><x-tab> </x-tab>-
Usually, this is off by default.<br>
<x-tab> </x-tab><x-tab> </x-tab><br><br>
--> <br>
- <docFactory enabled="false" type="jsp"><br>
<class>net.grcomputing.opencms.search.lucene.JspDocument</class>
<br>
</docFactory><br>
- <!-- For the news module. Enable if you use news <br>
--> <br>
- <docFactory enabled="false"
type="news"><br>
<class>net.grcomputing.opencms.search.lucene.NewsDocument</class>
<br>
</docFactory><br>
- <!-- For the forum module. Enable if you use forums. <br>
--> <br>
- <docFactory enabled="false"
type="forum"><br>
<class>de.wfnetz.opencms.modules.forum.ContributionDocument</class>
<br>
</docFactory><br>
- <!-- If you need to index XML Template files (bad idea) use
this: <br>
--> <br>
<docFactory enabled="false" type="XML
Template" /> <br>
</docFactories><br>
- <!-- <br>
<x-tab> </x-tab> -
<directories/> determines which directories are indexed. By
default,<br>
<x-tab> </x-tab> -
the /system directory is never indexed, so it is safe to index
root.<br>
<x-tab> </x-tab>
-<br>
<x-tab> </x-tab> -
If you want to specify only certain directories for indexing,
create<br>
<x-tab> </x-tab> -
one <directory/> entry per directory. Again, you may use subsearch
to<br>
<x-tab> </x-tab> -
override the default subsearch setting discussed above.<br>
<x-tab> </x-tab>
<br><br>
--> <br>
- <directories><br>
- <directory location="/"><br>
<section>Root</section> <br>
<subsearch>true</subsearch> <br>
</directory><br>
</directories><br>
- <!-- <br>
- Use this section to define specific
contentDefinitions. Provided below<br>
<x-tab> </x-tab>- are
entries for the news and forum modules.<br>
<x-tab> </x-tab><br><br>
--> <br>
- <contentDefinitions/><br>
</luceneSearch><br>
- <!-- <br>
- END lucene config<br>
<br><br>
--><br>
</font><font face="arial" size=2 color="#FF0000"> <br>
<hr>
<div align="center"></font></div>
<font face="Times New Roman, Times"> <br>
</font><br>
<font size=2>---<br>
Incoming mail is certified Virus Free.<br>
Checked by AVG anti-virus system
(<a href="http://www.grisoft.com/" eudora="autourl">http://www.grisoft.com</a>).<br>
Version: 6.0.708 / Virus Database: 464 - Release Date: 18/06/2004<br>
</font><br>
<font size=2>---<br>
Outgoing mail is certified Virus Free.<br>
Checked by AVG anti-virus system
(<a href="http://www.grisoft.com/" eudora="autourl">http://www.grisoft.com</a>).<br>
Version: 6.0.708 / Virus Database: 464 - Release Date: 18/06/2004<br>
</font><br>
<font size=2>---<br>
Incoming mail is certified Virus Free.<br>
Checked by AVG anti-virus system
(<a href="http://www.grisoft.com/" eudora="autourl">http://www.grisoft.com</a>).<br>
Version: 6.0.708 / Virus Database: 464 - Release Date: 18/06/2004<br>
</font><br>
<font size=2>---<br>
Outgoing mail is certified Virus Free.<br>
Checked by AVG anti-virus system
(<a href="http://www.grisoft.com/" eudora="autourl">http://www.grisoft.com</a>).<br>
Version: 6.0.708 / Virus Database: 464 - Release Date:
18/06/2004</font></blockquote></body>
</html>