[opencms-dev] problem with umlauts with OpenCMS 6

Scott Carlson scott at optionvue.com
Thu Feb 22 20:22:33 CET 2007


Are you sure they are broken?

The exported files have the single umlaut-u represented as two-bytes. That is correct. When you view the file, apparently you are viewing the file in some other encoding.

UTF-8 files commonly have a UTF-8 marker at the beginning of the file to indicate that the encoding is UTF-8. If you are saying that is not there and it should be, then THAT could be called a bug.

You can verify that the files are encoded correctly by opening them in an editor that allows you to choose the encoding, setting it to UTF-8, and then validating that the characters display properly. You could even drop the files in a browser, set the encoding to UTF-8, and then you will see the characters displayed "properly".

Scott

-----Original Message-----
From: opencms-dev-bounces at opencms.org [mailto:opencms-dev-bounces at opencms.org] On Behalf Of Sven Worm
Sent: Thursday, February 22, 2007 11:47 AM
To: The OpenCms mailing list
Subject: Re: [opencms-dev] problem with umlauts with OpenCMS 6

Hi there,
in order to make sure that you understood me correctly: the problem is
not a metter of displaying content. The Problem occurs when doing a
fileexport to a zipfile. The exported files are broken. Even the so
called "umlauts" within JSP-files are shown as its UTF-8
representative.

Here is our configuration:

Apache Tomcat 5.5.17
Sun Java jre1.5.0_07
MySQL 5.0.24
OpenCMS 6.2.1

We did not change any component since upgrading to OpenCMS 6.

You say that a possible solution would be to include a meta tag?

<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

best regards

Sven



2007/2/22, Christian Steinert <christian_steinert at web.de>:
>
> > Hello Community,
> >
> >
> > I have a problem with umlauts. OpenCMS displays "ä","ü","ö" on the page as it is entered into the content via fckeditor or as code within JSP pages. When we start to do file exports of certain directories and files the umlauts "ä","ü","ö" are replaced to characters like À ÃŒ �
>
> what your are seeing is the utf-8 representation of your characters.
>
> You have to declare your encoding in a meta tag in the header of each page so that the browser knows how to decode your characters
>     <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
>
>
> While your pages are sent from opencms, opencms will know the page encoding and communicate it to the browser by sending a http header. If your pages are exported and served by some static web server (usually apache), then that web server cannot tell the client which encoding it has to use and therefore your pages need to declare their encoding in aforementioned meta tag.
>
> hth
> christian
>
> __________________________________________________________________________
> Erweitern Sie FreeMail zu einem noch leistungsstärkeren E-Mail-Postfach!
> Mehr Infos unter http://freemail.web.de/home/landingpad/?mc=021131
>
>
> _______________________________________________
> This mail is sent to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://lists.opencms.org/mailman/listinfo/opencms-dev
>

_______________________________________________
This mail is sent to you from the opencms-dev mailing list
To change your list options, or to unsubscribe from the list, please visit
http://lists.opencms.org/mailman/listinfo/opencms-dev


More information about the opencms-dev mailing list