Hi all,<br><br>I am trying to index documents in Portuguese. The indexing (mostly of PDF files) goes ok. The index configuration is as follows:<br><br><indexsource><br> <name>source1</name><br>
<indexer class="org.opencms.search.CmsVfsIndexer"/><br> <resources><br> <resource>/sites/default/</resource><br> </resources><br>
<documenttypes-indexed><br> <name>xmlpage</name><br> <name>xmlcontent</name><br> <name>text</name><br> <name>pdf</name><br>
<name>rtf</name><br> <name>html</name><br> <name>msword</name><br> <name>msexcel</name><br> <name>mspowerpoint</name><br>
<name>image</name><br> <name>generic</name><br> </documenttypes-indexed><br> </indexsource><br><br><fieldconfiguration><br>
<name>standard</name><br> <description>The standard OpenCms 7.0 search index field configuration.</description><br> <fields><br> <field name="content" display="%(key.field.content)" store="true" index="true" excerpt="true"><br>
<mapping type="content"/><br> </field><br> <field name="title-key" display="-" store="true" index="untokenized" boost="0.0"><br>
<mapping type="property">Title</mapping><br> </field><br> <field name="title" display="%(key.field.title)" store="false" index="true"><br>
<mapping type="property">Title</mapping><br> </field><br> <field name="keywords" display="%(key.field.keywords)" store="true" index="true"><br>
<mapping type="property">Keywords</mapping><br> </field><br> <field name="description" display="%(key.field.description)" store="true" index="true"><br>
<mapping type="property">Description</mapping><br> </field><br> <field name="meta" display="%(key.field.meta)" store="false" index="true"><br>
<mapping type="property">Title</mapping><br> <mapping type="property">Keywords</mapping><br> <mapping type="property">Description</mapping><br>
</field><br> </fields><br> </fieldconfiguration><br><br> <analyzer><br> <class>org.apache.lucene.analysis.snowball.SnowballAnalyzer</class><br>
<stemmer>Portuguese</stemmer><br> <locale>pt</locale><br> </analyzer><br><br>But when I try to search for a sentence within quotation marks that has stop words, lets say "teste de programa", it shows no result! Of course, if I open the PDF and search for the sentence within, it appears...<br>
<br>I can only thought of a real basic (and strange) solution: to create a new Analyzer without the stop words (and possibly without stemming). This way, every search within quotation marks would bring me the "correct" results, at the cost of a very much greater index.<br>
<br>Any clues?<br><br>TIA,<br>