[jdom-interest] Character encoding from UTF-8 to ISO-8859-1

Nott, Matthias mnott at vignette.com
Wed Feb 6 14:50:33 PST 2002


Well, oracle has another opinion about the superset
situation. Try alter database character set utf8
on a latin1 database... -> is not a superset of...

M

-----Original Message-----
From: Alex Rosen [mailto:arosen at silverstream.com]
Sent: Wednesday, February 06, 2002 10:08 PM
To: 'Nott, Matthias'; 'dumdum 420'; jdom-interest at jdom.org
Subject: RE: [jdom-interest] Character encoding from UTF-8 to ISO-8859-1


> You cannot transcode exactly UTF-8 to Latin1 as UTF-8 has a richer
> set of characters

True - if your document contains chars that aren't in the character encoding
that you're using, such as Czech or Chinese characters if you're using
Latin1, then they need to be escaped as character reference, like "Ӓ".
I forget if XMLOutputter does this for you - I think it doesn't, but it's
planned for the future.

> and, especially, as UTF-8 is not a superset of Latin1.

No, all of Unicode is supported by UTF-8, and Unicode is a superset of the
Latin1 (ISO-8859-1) character set.

Alex


_______________________________________________
To control your jdom-interest membership:
http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourhos
t.com



More information about the jdom-interest mailing list