[opencms-dev] How to configure lucene to support chinese charin6A3?

??? shiys at langhua.cn
Mon Feb 7 14:58:29 CET 2005


Yes, the English analyser is perfect for both chinese charactor sets if you
use utf-8 as the encoding. Please see the attached ui.

Regards,

Shi Yusen/Beijing Langhua Ltd.

-----????-----
???: Benjamin Rasmussen [mailto:benjamin.rasmussen at ctxm.com]
????: 2005?2?7? 16:19
???: 'The OpenCms mailing list'; olli_aro at yahoo.co.uk
??: RE: ´đ¸´: [opencms-dev] How to configure lucene to support chinese
charin6A3?



Can you use the English analyser for both simplified and traditional
Chinese?


-----Original Message-----
From: opencms-dev-bounces at opencms.org
[mailto:opencms-dev-bounces at opencms.org] On Behalf Of ???
Sent: Thursday, February 03, 2005 19:11 PM
To: olli_aro at yahoo.co.uk; The OpenCms mailing list
Subject: ´đ¸´: [opencms-dev] How to configure lucene to support chinese
charin6A3?

Actually you do not need extra analyser for chinese charactors, the standard
analyser for english is perfect.

Lucene can make index for chinese charactors by parsing each charactor as
one English word. So what you should do is to separate your input string
with space between chinese charactors and then search the new string.

Regards,

Shi Yusen/Beijing Langhua Ltd.


-----????-----
???: Olli Aro [mailto:olli_aro at yahoo.co.uk]
????: 2005?2?3? 15:57
???: 'The OpenCms mailing list'
??: RE: [opencms-dev] How to configure lucene to support chinese charin6A3?


By default Lucene only has analyser for English and German (and this is
probably the case for OpenCMS Lucene part as well). In order to get other
languages working you need to create / install appropriate analyser. You can
find more information on Analysers from the Lucene site at
http://jakarta.apache.org/lucene/.

Regards,

Olli

P.S. Of course it might be that some of the Chinese OpenCMS users on this
list already has the analyser, which case they might be interested to
contribute it to OpenCMS?




From: opencms-dev-bounces at opencms.org
[mailto:opencms-dev-bounces at opencms.org] On Behalf Of joshwa
Sent: 03 February 2005 07:11
To: opencms-dev at opencms.org
Subject: [opencms-dev] How to configure lucene to support chinese char
in6A3?




--
No virus found in this outgoing message.
Checked by AVG Anti-Virus.
Version: 7.0.300 / Virus Database: 265.8.3 - Release Date: 31/01/2005



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20050207/54c03613/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: utf8-search.jpg
Type: application/octet-stream
Size: 41411 bytes
Desc: not available
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20050207/54c03613/attachment.obj>


More information about the opencms-dev mailing list