<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.2800.1400" name=GENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT face=Arial size=2>Based on the OpenCMS documentation I've read, the
follow is how the language specific data storage is handled in
OpenCMS:</FONT></DIV><FONT face=Arial size=2>
<DIV><BR>1) Content can be in edited / stored in any encoding (e.g. “Big5”,
“ISO8859_5” etc.)<BR>2) Resource is saved in native encoding in the OpenCms VFS
repository<BR>3) While processing dynamic pages, content is converted to UTF-8
(Unicode)<BR>4) Response encoding is set depending on business logic of the
application</DIV>
<DIV> </DIV>
<DIV>This still leaves me with the question of what encoding needs to be used on
the DB to store both Western and Asian languages. Since the data appears
to be stored in it's native encoding, then converted to UTF-8, I'll assume I
need a specific characterset to accommodate all the languages being used (since
it's ultimately converted to UTF-8, I would assume just create the Oracle
DB under the UTF-8 encoding). Then again, IF it's all
stored simply as unicode in the DB as text, I'm not sure I need to create
another instance running under UTF-8.</DIV>
<DIV> </DIV>
<DIV>Thoughts?</DIV>
<DIV> </DIV>
<DIV>-Mark</FONT></DIV>
<BLOCKQUOTE dir=ltr
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
<DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
<DIV
style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B>
<A title=mjw22@drexel.edu href="mailto:mjw22@drexel.edu">Mark J. Weidner</A>
</DIV>
<DIV style="FONT: 10pt arial"><B>To:</B> <A title=opencms-dev@opencms.org
href="mailto:opencms-dev@opencms.org">opencms-dev@opencms.org</A> </DIV>
<DIV style="FONT: 10pt arial"><B>Sent:</B> Thursday, July 01, 2004 10:25
AM</DIV>
<DIV style="FONT: 10pt arial"><B>Subject:</B> [opencms-dev] Oracle8i Character
Encoding</DIV>
<DIV><BR></DIV>
<DIV><FONT face=Arial size=2>Hello everyone,</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>I've been using OpenCMS 5.0.1 storing english
content using Oracle8i, and will now be incorporating additional European and
Asian languages. As such, I'm curious as to whether I need to setup a
separate Oracle instance using a specific character encoding (I'll assume
as UTF-8, or perhaps UTF-16), or does the default Oracle encoding
handle this properly with OpenCMS? If anyone has any experience in this
area, please let me know.</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Thanks!</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>-Mark</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV></BLOCKQUOTE></BODY></HTML>