[opencms-dev] umlauts in ISO-8859-15 encoded documents

Christoph P. Kukulies kuku at physik.rwth-aachen.de
Fri Apr 20 15:35:15 CEST 2007


I got a bunch of HTML-pages
"<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">"
"<html>"
"<head>"
  "<meta content="text/html; charset=ISO-8859-15""
 "http-equiv="content-type">"

(Hope, I succeeded in guarding the HTML code against expansion though
mail-readers with the quotes)

The file contains Umlauts, like "für" in this text. The ü is not
represented a HTML character \ü but as 8bit character, hex code
0xfc.

The files all came in a ZIP file. I uploaded that ZIP-file to a folder I
created (a download special folder - hope that doesn't influence things
negatively).

The files appeared as text file.
I created a page for each HTML file and pasted the content into the
"Inhalt" section of my page. 

The result, when I vie the page, is, that all umlaut and othe 8bit
characters appear as question mark surrounded by a black diamond.

Any clues what the best way would be to transport the pages from 
an outside textediting-system (like MS Word) into OpenCms for
publishing?

--
Chris Christoph P. U. Kukulies kukulies (at) rwth-aachen.de




More information about the opencms-dev mailing list