<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=iso-8859-2">
<meta name=Generator content="Microsoft Word 11 (filtered medium)">
<!--[if !mso]>
<style>
v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style>
<![endif]-->
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman";}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{color:blue;
text-decoration:underline;}
p
{mso-margin-top-alt:auto;
margin-right:0in;
mso-margin-bottom-alt:auto;
margin-left:0in;
font-size:12.0pt;
font-family:"Times New Roman";}
span.EmailStyle18
{mso-style-type:personal-reply;
font-family:Arial;
color:navy;}
@page Section1
{size:8.5in 11.0in;
margin:1.0in 1.25in 1.0in 1.25in;}
div.Section1
{page:Section1;}
-->
</style>
</head>
<body lang=EN-US link=blue vlink=blue>
<div class=Section1>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>This means that every character has a meaning
on its own? The Chinese characters are not combined to make up a meaning? E.g.
you will never need two characters to make a word.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>Thanks,<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>Benjamin<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'><o:p> </o:p></span></font></p>
<div>
<div class=MsoNormal align=center style='text-align:center'><font size=3
face="Times New Roman"><span style='font-size:12.0pt'>
<hr size=2 width="100%" align=center tabindex=-1>
</span></font></div>
<p class=MsoNormal><b><font size=2 face=Tahoma><span style='font-size:10.0pt;
font-family:Tahoma;font-weight:bold'>From:</span></font></b><font size=2
face=Tahoma><span style='font-size:10.0pt;font-family:Tahoma'>
opencms-dev-bounces@opencms.org [mailto:opencms-dev-bounces@opencms.org] <b><span
style='font-weight:bold'>On Behalf Of </span></b>???<br>
<b><span style='font-weight:bold'>Sent:</span></b> Monday, February 07, 2005
15:58 PM<br>
<b><span style='font-weight:bold'>To:</span></b> The OpenCms mailing list<br>
<b><span style='font-weight:bold'>Subject:</span></b> re: [opencms-dev] How to
configure lucene to support chinesecharin6A3?</span></font><o:p></o:p></p>
</div>
<p class=MsoNormal><font size=3 face="Times New Roman"><span style='font-size:
12.0pt'><o:p> </o:p></span></font></p>
<p style='margin-bottom:12.0pt'><font size=2 face="Times New Roman"><span
style='font-size:10.0pt'>Yes, the English analyser is perfect for both chinese
charactor sets if you use utf-8 as the encoding. Please see the attached ui.<br>
<br>
Regards,<br>
<br>
Shi Yusen/Beijing Langhua Ltd.<br>
<br>
-----????-----<br>
???: Benjamin Rasmussen [<a href="mailto:benjamin.rasmussen@ctxm.com">mailto:benjamin.rasmussen@ctxm.com</a>]<br>
????: 2005?2?7? 16:19<br>
???: 'The OpenCms mailing list'; olli_aro@yahoo.co.uk<br>
??: RE: ΄πΈ΄: [opencms-dev] How to configure lucene to support chinese
charin6A3?<br>
<br>
<br>
<br>
Can you use the English analyser for both simplified and traditional<br>
Chinese?<br>
<br>
<br>
-----Original Message-----<br>
From: opencms-dev-bounces@opencms.org<br>
[<a href="mailto:opencms-dev-bounces@opencms.org">mailto:opencms-dev-bounces@opencms.org</a>]
On Behalf Of ???<br>
Sent: Thursday, February 03, 2005 19:11 PM<br>
To: olli_aro@yahoo.co.uk; The OpenCms mailing list<br>
Subject: ΄πΈ΄: [opencms-dev] How to configure lucene to support chinese<br>
charin6A3?<br>
<br>
Actually you do not need extra analyser for chinese charactors, the standard<br>
analyser for english is perfect.<br>
<br>
Lucene can make index for chinese charactors by parsing each charactor as<br>
one English word. So what you should do is to separate your input string<br>
with space between chinese charactors and then search the new string.<br>
<br>
Regards,<br>
<br>
Shi Yusen/Beijing Langhua Ltd.<br>
<br>
<br>
-----????-----<br>
???: Olli Aro [<a href="mailto:olli_aro@yahoo.co.uk">mailto:olli_aro@yahoo.co.uk</a>]<br>
????: 2005?2?3? 15:57<br>
???: 'The OpenCms mailing list'<br>
??: RE: [opencms-dev] How to configure lucene to support chinese charin6A3?<br>
<br>
<br>
By default Lucene only has analyser for English and German (and this is<br>
probably the case for OpenCMS Lucene part as well). In order to get other<br>
languages working you need to create / install appropriate analyser. You can<br>
find more information on Analysers from the Lucene site at<br>
<a href="http://jakarta.apache.org/lucene/" target="_blank">http://jakarta.apache.org/lucene/</a>.<br>
<br>
Regards,<br>
<br>
Olli<br>
<br>
P.S. Of course it might be that some of the Chinese OpenCMS users on this<br>
list already has the analyser, which case they might be interested to<br>
contribute it to OpenCMS?<br>
<br>
<br>
<br>
<br>
From: opencms-dev-bounces@opencms.org<br>
[<a href="mailto:opencms-dev-bounces@opencms.org">mailto:opencms-dev-bounces@opencms.org</a>]
On Behalf Of joshwa<br>
Sent: 03 February 2005 07:11<br>
To: opencms-dev@opencms.org<br>
Subject: [opencms-dev] How to configure lucene to support chinese char<br>
in6A3?<br>
<br>
<br>
<br>
<br>
--<br>
No virus found in this outgoing message.<br>
Checked by AVG Anti-Virus.<br>
Version: 7.0.300 / Virus Database: 265.8.3 - Release Date: 31/01/2005<br>
<br>
</span></font><o:p></o:p></p>
</div>
</body>
</html>