[opencms-dev] Oracle8i Character Encoding

Alexander Kandzior alex at opencms.org
Sat Jul 10 08:19:02 CEST 2004


Storage in the DB is done in 2 ways:
 
1. The OpenCms file content (edited with the WYSIWYG editor). This is stored
as BLOB so the DB dosen't care about the encoding. OpenCms handles this in
the encoding set in opencms.properties.
 
2. Another thing are the OpenCms properties (like e.g. "Title" etc.) OpenCms
internally handles this as Strings, and they are written as such to the DB.
So for the properties the DB encoding matters.
 
Best Regards,
Alex.

Alexander Kandzior
Alkacon Software - The OpenCms Experts
http://www.alkacon.com <http://www.alkacon.com/>  

-----Original Message-----
From: opencms-dev-admin at opencms.org [mailto:opencms-dev-admin at opencms.org]
On Behalf Of Mark J. Weidner
Sent: Saturday, July 10, 2004 3:22 AM
To: opencms-dev at opencms.org
Subject: Re: [opencms-dev] Oracle8i Character Encoding


Based on the OpenCMS documentation I've read, the follow is how the language
specific data storage is handled in OpenCMS:


1) Content can be in edited / stored in any encoding (e.g. "Big5",
"ISO8859_5" etc.)
2) Resource is saved in native encoding in the OpenCms VFS repository
3) While processing dynamic pages, content is converted to UTF-8 (Unicode)
4) Response encoding is set depending on business logic of the application
 
This still leaves me with the question of what encoding needs to be used on
the DB to store both Western and Asian languages.  Since the data appears to
be stored in it's native encoding, then converted to UTF-8, I'll assume I
need a specific characterset to accommodate all the languages being used
(since it's ultimately converted to UTF-8, I would assume just create the
Oracle DB under the UTF-8 encoding).  Then again, IF it's all stored simply
as unicode in the DB as text, I'm not sure I need to create another instance
running under UTF-8.
 
Thoughts?
 
-Mark

----- Original Message ----- 
From: Mark J. Weidner <mailto:mjw22 at drexel.edu>  
To: opencms-dev at opencms.org 
Sent: Thursday, July 01, 2004 10:25 AM
Subject: [opencms-dev] Oracle8i Character Encoding

Hello everyone,
 
I've been using OpenCMS 5.0.1 storing english content using Oracle8i, and
will now be incorporating additional European and Asian languages.  As such,
I'm curious as to whether I need to setup a separate Oracle instance using a
specific character encoding (I'll assume as UTF-8, or perhaps UTF-16), or
does the default Oracle encoding handle this properly with OpenCMS?  If
anyone has any experience in this area, please let me know.
 
Thanks!
 
-Mark
 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20040710/c55bea15/attachment.htm>


More information about the opencms-dev mailing list