<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE></TITLE>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-2">
<META content="MSHTML 6.00.2800.1479" name=GENERATOR></HEAD>
<BODY>
<P><FONT size=2>Yes, the English analyser is perfect for both chinese charactor
sets if you use utf-8 as the encoding. Please see the attached
ui.<BR><BR>Regards,<BR><BR>Shi Yusen/Beijing Langhua
Ltd.<BR><BR>-----????-----<BR>???: Benjamin Rasmussen [<A
href="mailto:benjamin.rasmussen@ctxm.com">mailto:benjamin.rasmussen@ctxm.com</A>]<BR>????:
2005?2?7? 16:19<BR>???: 'The OpenCms mailing list'; olli_aro@yahoo.co.uk<BR>??:
RE: ΄πΈ΄: [opencms-dev] How to configure lucene to support chinese
charin6A3?<BR><BR><BR><BR>Can you use the English analyser for both simplified
and traditional<BR>Chinese?<BR><BR><BR>-----Original Message-----<BR>From:
opencms-dev-bounces@opencms.org<BR>[<A
href="mailto:opencms-dev-bounces@opencms.org">mailto:opencms-dev-bounces@opencms.org</A>]
On Behalf Of ???<BR>Sent: Thursday, February 03, 2005 19:11 PM<BR>To:
olli_aro@yahoo.co.uk; The OpenCms mailing list<BR>Subject: ΄πΈ΄: [opencms-dev]
How to configure lucene to support chinese<BR>charin6A3?<BR><BR>Actually you do
not need extra analyser for chinese charactors, the standard<BR>analyser for
english is perfect.<BR><BR>Lucene can make index for chinese charactors by
parsing each charactor as<BR>one English word. So what you should do is to
separate your input string<BR>with space between chinese charactors and then
search the new string.<BR><BR>Regards,<BR><BR>Shi Yusen/Beijing Langhua
Ltd.<BR><BR><BR>-----????-----<BR>???: Olli Aro [<A
href="mailto:olli_aro@yahoo.co.uk">mailto:olli_aro@yahoo.co.uk</A>]<BR>????:
2005?2?3? 15:57<BR>???: 'The OpenCms mailing list'<BR>??: RE: [opencms-dev] How
to configure lucene to support chinese charin6A3?<BR><BR><BR>By default Lucene
only has analyser for English and German (and this is<BR>probably the case for
OpenCMS Lucene part as well). In order to get other<BR>languages working you
need to create / install appropriate analyser. You can<BR>find more information
on Analysers from the Lucene site at<BR><A
href="http://jakarta.apache.org/lucene/"
target=_blank>http://jakarta.apache.org/lucene/</A>.<BR><BR>Regards,<BR><BR>Olli<BR><BR>P.S.
Of course it might be that some of the Chinese OpenCMS users on this<BR>list
already has the analyser, which case they might be interested to<BR>contribute
it to OpenCMS?<BR><BR><BR><BR><BR>From: opencms-dev-bounces@opencms.org<BR>[<A
href="mailto:opencms-dev-bounces@opencms.org">mailto:opencms-dev-bounces@opencms.org</A>]
On Behalf Of joshwa<BR>Sent: 03 February 2005 07:11<BR>To:
opencms-dev@opencms.org<BR>Subject: [opencms-dev] How to configure lucene to
support chinese char<BR>in6A3?<BR><BR><BR><BR><BR>--<BR>No virus found in this
outgoing message.<BR>Checked by AVG Anti-Virus.<BR>Version: 7.0.300 / Virus
Database: 265.8.3 - Release Date:
31/01/2005<BR><BR><BR></FONT></P></BODY></HTML>