[opencms-dev] Indexing files outside the VFS

Siegfried Puchbauer siegfried.puchbauer at gmail.com
Mon Jan 9 15:08:21 CET 2006


In your case, I would create a native lucene index. Just setting up a
seperate search server which indexes your rfs documents.

have a look at: http://lucene.apache.org/

hth

2006/1/9, cannot say <pikica3342 at yahoo.com>:
>
> Hi.
>     I have a large library of documents (in the order of many gigabytes)
> that I am trying to make searchable using the OpenCms full-text search. I
> understand that in order to index a file you must upload it into the VFS
> under a path that is listed as source in the search configuration file.
> Uploading (and waiting for the applet to zip files before upload) is a bit
> impractical, mainly because it's time consuming, this is the small problem
> though.
>     The bigger problem is that I would have to upload every document i
> want searchable and thus have duplicates of each document (one in the VFS
> and one in the real FS) and this would take too much disk space. So my
> question is, "Is there a way to index documents on the real file-system with
> OpenCms?" Or does anyone have a way of making a web accessable search of a
> document library.
>     Further more, the library of documents is ever changing, constantly
> some things are added and some removed. I so far understand that you must
> remake the whole index when you change the contents of its' sources. This is
> impractical as remaking the index of a document library of this size takes a
> long time, is there a way, and if so how would i make the new documents i
> add searchable without remaking the whole index in the lengthy process ?
>
> Help is much appreciated :)
>
> ------------------------------
> Yahoo! Photos
> Got holiday prints? See all the ways<http://us.rd.yahoo.com/mail_us/taglines/holidayprints/*http://pa.yahoo.com/*http://us.rd.yahoo.com/mail_us/taglines/photos/evt=38089/*http://pg.photos.yahoo.com/ph//print_splash>to get quality prints in your hands ASAP.
>
>
>
> _______________________________________________
> This mail is sent to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://lists.opencms.org/mailman/listinfo/opencms-dev
>
>


--
Mit freundlichen Grüßen

Siegfried Puchbauer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20060109/0c1641b5/attachment.htm>


More information about the opencms-dev mailing list