[opencms-dev] Solr indexing of custom content type
Daniel Seidel
d.seidel at alkacon.com
Tue Nov 27 09:31:17 CET 2018
Hi Thorsten,
the JSP-Search config is multi-lingual, hence you can have a separate
configuration for each locale. And it's good if you restrict the fields
to search in appropriately. Nevertheless, it should have searched in
"text" by default which should contain a copy of "content" which itself
should have "content_de" as part of it - hence I don't understand the
original problem still (assuming a default configuration) - it should
have worked directly.
Best, Daniel.
Am 27.11.18 um 09:24 schrieb Thorsten Duhn:
>
> Hello,
>
> thanks a lot for your patience.
>
> The “q=content_de:…” did it finally. In JSP-Search config a query
> modifier “content_de:%(query)” is needed.
>
> Which in general leaves the question how to do this on multilanguage
> sites, but for now that’s not my problem.
>
> Nevertheless I’m curious. Docs read like “Even with no configuration
> at all, you have a, thought simple, full text search” (and Apollo
> template has a quite more difficult search configuration I don’t
> understand).
>
> https://documentation.opencms.org/opencms-documentation/more-opencms-features/solr-search-integration/full-text-search-with-solr/
> <https://documentation.opencms.org/opencms-documentation/more-opencms-features/solr-search-integration/full-text-search-with-solr/index.html>
>
>
> Regards,
>
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de
>
> Web http://www.regio-gmbh.de
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag von *Daniel Seidel
> *Gesendet:* Montag, 26. November 2018 11:23
> *An:* opencms-dev at opencms.org
> *Betreff:* Re: [opencms-dev] Solr indexing of custom content type
>
> Hi Torsten,
>
> from your files, it looks like the contents are indexed correctly -
> the content_de of the containerpage contains "Lorem ipsum" as well. So
> maybe your query is not correct? Try, e.g., with
> q=content_de:Lorem&fq=con_locales:de&fq=type:containerpage to just
> find the container pages.
>
> Best Daniel.
>
> Am 22.11.18 um 11:08 schrieb Thorsten Duhn:
>
> Hello,
>
> not quite sure this is what you mean (I guess sent the first one
> already), but here they are:
>
> This is the response for the search I’m trying to implement in
> “handleSolrSelect-jspsearch.xml” (note, containerPageOnly is true
> for “custom-content” and new “custom-flexible”, false for
> “custom-news” which has detail page).
>
> /handleSolrSelect?q=lorem&fq=type:custom-*&fq=con_locales:*&fq=parent-folders:%22/sites/default/%22
>
> And this is for the page that is not found in
> “handleSolrSelect-containerpage.xml” containing “custom-flexible”
> content.
>
> /handleSolrSelect?q=*&fq=parent-folders:%22/sites/default/projekte/dummy/%22
>
> Regards,
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag von *Daniel
> Seidel
> *Gesendet:* Donnerstag, 22. November 2018 08:37
> *An:* opencms-dev at opencms.org <mailto:opencms-dev at opencms.org>
> *Betreff:* Re: [opencms-dev] Solr indexing of custom content type
>
> Hi Thorsten,
>
> please send the Solr-Handler-Response for a) the content b) the
> container-page the content is on. You can search like
> q=*:*&fq=path:"/the/root/path/to/the/resource".
>
> Best, Daniel.
>
> Am 22.11.18 um 08:35 schrieb Thorsten Duhn:
>
> Just to add, it’s not about nested schemas, a simpler
> “flexible” content has the same problem for me: I cannot find
> anything via its container page. So it seems obvious that
> there is a problem on my side, as of course there are several
> working usages, even OC doc site. But I guess, I double
> checked and tested mostly everything I can imagine.
>
> Regards
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und
> Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag von
> *Thorsten Duhn
> *Gesendet:* Mittwoch, 21. November 2018 09:22
> *An:* The OpenCms mailing list
> *Betreff:* Re: [opencms-dev] Solr indexing of custom content type
>
> Hello,
>
> no, all this does not change anything. "locale-available"
> already was set correctly to “de”, and I cannot spot any
> difference or problem remaining in formatter config. Attached
> it, but tried a lot of possible settings in it with no
> success. The only game change is containerPageOnly in schema,
> but that is useless for site search where I need to find the
> container page containing this content.
>
> Sure this really works? I remember there were also problems on
> this before <cms:search> was introduced. And with OC10 not
> only <cms:search> was new, also Apollo template came with a
> simpler „Flexible“ content type, with no nested content. But I
> do not find indication in docs, that nested structure is not
> valid anymore.
>
> I’m quite helpless for now. Unfortunately official Alkacon
> support only offers packages of ten incidents, which is way
> too much for our needs.
>
> I guess I will now try to create a new content type – based on
> Apollo approach, not nested. But this will break upgrade path
> for some of our sites, where site search would be necessary as
> well.
>
> Any further ideas?
>
> Regards,
>
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und
> Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag von
> *Daniel Seidel
> *Gesendet:* Dienstag, 20. November 2018 17:14
> *An:* opencms-dev at opencms.org <mailto:opencms-dev at opencms.org>
> *Betreff:* Re: [opencms-dev] Solr indexing of custom content type
>
> Hi Thorsten,
>
> looks like your contents by themself are indexed correctly.
> You may again have a look at the formatter-configuration xml
> file, of the formatter you drop your content to the container
> pages and you may find out if you find containerpages at all
> and you could try to set the "locale-available" property at
> the site root to "de" if you use only "de" - and then re-index
> a page you want to find.
>
> Best, Daniel.
>
> Am 20.11.18 um 15:57 schrieb Thorsten Duhn:
>
> Hello,
>
> I’m not sure about posting files in here, but also the
> relevant XSD here. Anything wrong with searchsettings?
>
> Regards,
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und
> Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag von
> *Daniel Seidel
> *Gesendet:* Dienstag, 20. November 2018 15:01
> *An:* opencms-dev at opencms.org <mailto:opencms-dev at opencms.org>
> *Betreff:* Re: [opencms-dev] Solr indexing of custom
> content type
>
> Hi Thorsten,
>
> when you look at the search result for the content itself,
> what's in the fields "content_de"/"content_en" ? This is
> what should be added to the container page.
>
> Best, Daniel.
>
> Am 20.11.18 um 14:53 schrieb Thorsten Duhn:
>
> Hello again,
>
> no, I wasn’t right, it is not solved I should have
> taken a look more carefully.
>
> What changed the situation in my test was the change
> to containerpageonly="false". With that my
> custom-content elements were found. But of course the
> link generated is to its XML file, not to the
> containing container page. Changing back returns me to
> the given problem, I get one result where there should
> be two, only the news detail page.
>
> What is found is text contained in page titles. But
> not its content. Not even contents title (which
> obviously isn’t part of the nested types).
>
> Quite bizarre.
>
> Regards,
>
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und
> Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag
> von *Thorsten Duhn
> *Gesendet:* Dienstag, 20. November 2018 10:29
> *An:* The OpenCms mailing list
> *Betreff:* Re: [opencms-dev] Solr indexing of custom
> content type
>
> Hello,
>
> wow, thanks a lot, now it works. I guess removing
> search settings from nested XSD did the trick. So I
> did too much before.
>
> Thanks,
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und
> Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr. DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im Auftrag
> von *Daniel Seidel
> *Gesendet:* Dienstag, 20. November 2018 09:59
> *An:* opencms-dev at opencms.org
> <mailto:opencms-dev at opencms.org>
> *Betreff:* Re: [opencms-dev] Solr indexing of custom
> content type
>
> Hi Thorsten,
>
> for debugging, I would make the custom-content
> findable by itself (searchsettings
> containerpageonly="false") than you only need to touch
> one of these contents (maybe with rewrite content, I
> always forget about that) and then use the solr
> handler to see how it is indexed. Moreover, check,
> where the searchsettings are done. I think in 10.5
> it's necessary to have them in the main XSD.
>
> Best, Daniel.
>
> Am 20.11.18 um 09:49 schrieb Thorsten Duhn:
>
> Hello,
>
> sorry forgot to mention (so many different places
> involved), of course that is/always was set true.
>
> Regards,
>
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung und
> Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de
> <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr.
> DE187010339
>
> *Von:*opencms-dev-bounces at opencms.org
> <mailto:opencms-dev-bounces at opencms.org>
> [mailto:opencms-dev-bounces at opencms.org] *Im
> Auftrag von *Daniel Seidel
> *Gesendet:* Dienstag, 20. November 2018 09:47
> *An:* opencms-dev at opencms.org
> <mailto:opencms-dev at opencms.org>
> *Betreff:* Re: [opencms-dev] Solr indexing of
> custom content type
>
> Hi Thorsten,
>
> you should look at the formatter configuration of
> your content. There's a boolean option, you have
> to check if you want to index your content with
> the page - i think it's called "Search content".
>
> Best, Daniel.
>
> Am 20.11.18 um 09:27 schrieb Thorsten Duhn:
>
> Hello,
>
> I have a "custom-content" resource type, a bit
> like a evolution of what was "Flexible" in
> OC8/OC9, with nested choice of HTML, code,
> image and alike. Also I have a “custom-news”
> type with no nested content. Having put Lorem
> ipsum content on my site, a container page
> with “custom-content” and one item for
> “custom-news” list/detail. But when I try to
> get <cms:search> running I only find
> “custom-news” detail pages, not container
> pages having “custom-content”.
>
> /handleSolrSelect?q=ipsum&fq=type:custom-*+OR+type:containerpage&fq=parent-folders:%22/sites/default/%22
>
> I don’t know where else to look at. All
> schemas have <searchsettings> (for all
> elements, “searchcontent” true and false),
> mainly as I see it in OC9 “flexible.xsd”. I
> reindexed on- and offline indexes several
> times, modified content elements and saved,
> restarted Tomcat, but always find only one of
> two elements.
>
> How can I find content from cointainerpages?
> Where can I find indications of what goes wrong?
>
> System is OC 10.5.4, content is German (and
> “fq” of above request contains “con_locales:de”).
>
> Regards,
>
> Thorsten
>
> --
>
> B.Sc. Thorsten Duhn
>
> regio gmbh - Institut für Regionalentwicklung
> und Informationssysteme
>
> Uhlhornsweg 99a, D-26129 Oldenburg
>
> Fon +49 (0)441 798-2818
>
> Fax +49 (0)441 798-3002
>
> Mail duhn at regio-gmbh.de
> <mailto:duhn at regio-gmbh.de>
>
> Web http://www.regio-gmbh.de
> <http://www.regio-gmbh.de>
>
> Geschäftsführer: Bernd Kramer, Dr. Uwe Kröcher
>
> Amtsgericht Oldenburg (HRB 3903), USt-IdNr.
> DE187010339
>
>
>
>
>
>
>
>
> _______________________________________________
>
> This mail is sent to you from the opencms-dev mailing list
>
> To change your list options, or to unsubscribe from the list, please visit
>
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
>
>
>
>
>
>
> --
>
> Kind Regards,
>
> Daniel.
>
>
>
> -------------------
>
>
>
> Daniel Seidel
>
>
>
> Alkacon Software GmbH & Co. KG - The OpenCms Experts
>
> http://www.alkacon.com -http://www.opencms.org
>
>
>
>
>
>
>
> _______________________________________________
>
> This mail is sent to you from the opencms-dev mailing list
>
> To change your list options, or to unsubscribe from the list, please visit
>
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
>
>
>
>
>
>
> --
>
> Kind Regards,
>
> Daniel.
>
>
>
> -------------------
>
>
>
> Daniel Seidel
>
>
>
> Alkacon Software GmbH & Co. KG - The OpenCms Experts
>
> http://www.alkacon.com -http://www.opencms.org
>
>
>
>
>
> _______________________________________________
>
> This mail is sent to you from the opencms-dev mailing list
>
> To change your list options, or to unsubscribe from the list, please visit
>
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
>
>
>
>
>
>
> --
>
> Kind Regards,
>
> Daniel.
>
>
>
> -------------------
>
>
>
> Daniel Seidel
>
>
>
> Alkacon Software GmbH & Co. KG - The OpenCms Experts
>
> http://www.alkacon.com -http://www.opencms.org
>
> _______________________________________________
>
> This mail is sent to you from the opencms-dev mailing list
>
> To change your list options, or to unsubscribe from the list, please visit
>
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
>
>
>
>
>
>
> --
>
> Kind Regards,
>
> Daniel.
>
>
>
> -------------------
>
>
>
> Daniel Seidel
>
>
>
> Alkacon Software GmbH & Co. KG - The OpenCms Experts
>
> http://www.alkacon.com -http://www.opencms.org
>
>
>
>
> _______________________________________________
>
> This mail is sent to you from the opencms-dev mailing list
>
> To change your list options, or to unsubscribe from the list, please visit
>
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
>
>
>
>
>
>
> --
>
> Kind Regards,
>
> Daniel.
>
>
>
> -------------------
>
>
>
> Daniel Seidel
>
>
>
> Alkacon Software GmbH & Co. KG - The OpenCms Experts
>
> http://www.alkacon.com -http://www.opencms.org
>
>
>
> _______________________________________________
>
> This mail is sent to you from the opencms-dev mailing list
>
> To change your list options, or to unsubscribe from the list, please visit
>
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
> --
> Kind Regards,
> Daniel.
>
> -------------------
> Daniel Seidel
> Alkacon Software GmbH & Co. KG - The OpenCms Experts
> http://www.alkacon.com -http://www.opencms.org
>
> _______________________________________________
> This mail is sent to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://lists.opencms.org/cgi-bin/mailman/listinfo/opencms-dev
>
>
>
--
Kind Regards,
Daniel.
-------------------
Daniel Seidel
Alkacon Software GmbH & Co. KG - The OpenCms Experts
http://www.alkacon.com - http://www.opencms.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20181127/cfaf6d89/attachment.htm>
More information about the opencms-dev
mailing list