[opencms-dev] umlauts in ISO-8859-15 encoded documents
Christoph P. Kukulies
kuku at physik.rwth-aachen.de
Fri Apr 20 15:35:15 CEST 2007
I got a bunch of HTML-pages
"<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">"
"<html>"
"<head>"
"<meta content="text/html; charset=ISO-8859-15""
"http-equiv="content-type">"
(Hope, I succeeded in guarding the HTML code against expansion though
mail-readers with the quotes)
The file contains Umlauts, like "für" in this text. The ü is not
represented a HTML character \ü but as 8bit character, hex code
0xfc.
The files all came in a ZIP file. I uploaded that ZIP-file to a folder I
created (a download special folder - hope that doesn't influence things
negatively).
The files appeared as text file.
I created a page for each HTML file and pasted the content into the
"Inhalt" section of my page.
The result, when I vie the page, is, that all umlaut and othe 8bit
characters appear as question mark surrounded by a black diamond.
Any clues what the best way would be to transport the pages from
an outside textediting-system (like MS Word) into OpenCms for
publishing?
--
Chris Christoph P. U. Kukulies kukulies (at) rwth-aachen.de
More information about the opencms-dev
mailing list