[opencms-dev] Search into OpenCmsVfsFile

Julien Lechevanton j.lechevanton at eurelis.com
Fri Mar 21 14:25:29 CET 2008


Hi,

Thanks for those informations, that's the solution we are trying to implement by now.

In fact, you're right, we don't need to indexthe homepage containing the news summary but that was an example. We are now implementing a portlet based template tool and in that way, considering that portlets are OpenCmsVfsFiles, we need to index them.

Thanks for your help,

J.

-----Message d'origine-----
De : opencms-dev-bounces at opencms.org [mailto:opencms-dev-bounces at opencms.org] De la part de Claus Priisholm
Envoyé : vendredi 21 mars 2008 10:48
À : The OpenCms mailing list
Objet : Re: [opencms-dev] Search into OpenCmsVfsFile

The default indexing concerns itself only with the contents of the files 
as they appear in VFS, it does not index the generated HTML. This is 
actually a feature as it lessens the amount of noise in the index. In 
case where your template includes the content from a different file (or 
files), the summary page itself is seen as being empty (as in not having 
any contents inside the control file) - this also applies to JSPs (which 
are not included in the indexing as per default setup, and even if they 
were, only the JSP code is indexed, not the output generated by the code).

Generally speaking in terms of indexing/searching, I would consider a 
summary page to be pure noise since it does not contain anything that is 
not available on the individual pages summarized in the page. Hence it 
is better for the user to find the page with the contents rather than 
the summary page with only a summary.

But of course, there may be exceptions to that rule, and to be able to 
deal with that I would suggest that you sub-class 
org.opencms.search.documents.CmsDocumentXmlContent and get it to extract 
the necessary contents from the file(s) that makes up the summary.

/Claus

Julien Lechevanton wrote:
> Hi,
> 
> Nobody is experiencing this problem ?
> 
> Regards,
> J
> 
> -----Message d'origine-----
> De : opencms-dev-bounces at opencms.org [mailto:opencms-dev-bounces at opencms.org] De la part de Julien Lechevanton
> Envoyé : mardi 18 mars 2008 14:55
> À : The OpenCms mailing list
> Objet : [opencms-dev] Search into OpenCmsVfsFile
> 
> Hi,
> 
> We are trying to set up the search engine and we are not able to index
> all contents we need. Here is an example : 
> - we have got news summary on our homepage (a lit included through an
> OpenCmsVfsFile)
> - we have got news detail on pages (custom xsd)
> 
> The search engine will only return the page and not the homepage
> containing the news summary (OpenCmsVfsFile).
> 
> Is there a way to make the search engine index whole content including
> the OpenCms** items ?
> 
> Thanks
> 
> J.
> 
> _______________________________________________
> This mail is sent to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://lists.opencms.org/mailman/listinfo/opencms-dev
> 
> _______________________________________________
> This mail is sent to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://lists.opencms.org/mailman/listinfo/opencms-dev
> 


_______________________________________________
This mail is sent to you from the opencms-dev mailing list
To change your list options, or to unsubscribe from the list, please visit
http://lists.opencms.org/mailman/listinfo/opencms-dev



More information about the opencms-dev mailing list