AW: AW: [opencms-dev] Search engine / pure Java

Thomas Maerz thomasmaerz at gmx.de
Tue Jan 14 16:14:32 CET 2003


Hello Werner,

just to point some things out:

. the data shouldn't be public (no web crawcler)
  - any web crawler will/_should_ fail, because he won't gain any access
    to data; all data isn't to be exported statically.
. all data can (no must) have different rights
  - so every user _can_ have its own search index
. group indexes don't mask non-group users

. How to create the/any index?

>> But then there's still the problem of creating the index! I wasn't able
>> to export the data statically to files to create an index, because I get
>> database access errors and I _cannot_ export any file which has any
>> access restrictions. Maybe rewrite the export classes, but I don't like
>> to access any core code of OpenCms.
>>
>> I already had to update my own navigation class (something about
>> "LinkSubstitution.*"); but copy & and paste aren't always sufficient.

> you might consider to index through a crawler from the web rather then from
> your server's file system. Have a a look at LARM's pre-releases, which are
> contained in jakarta'a CVS under jakarta-lucene-sandbox - this might do the
> job - I haven't yet had the chance to eval it and I'd love to read about
> someone's experiences.

I didn't look there, yet, but I appreciate _any_ discussion on this
theme, as this seems for _me_ almost as important as a multi-site
OpenCms. I look at the software!

Regards,
Thomas



More information about the opencms-dev mailing list