[jdom-interest] Character escaping
Malachi de AElfweald
malachi at tremerechantry.com
Mon Mar 17 08:35:20 PST 2003
On Sun, 16 Mar 2003 13:49:54 -0700, Alex Rosen <arosen at novell.com> wrote:
> The second issue is with characters > 16 bits, which I understand only
> partially. (Elliotte you'll have to help me out here.) It seems that
> Java doesn't fully support this now, since there's a JSR to add support
> for them in JDK 1.5. Presumably this support will use surrogate pairs,
> where it takes two Java chars to represent these new Unicode characters.
> But CharsetEncoder in 1.4 seems to take this into account, it talks
> about surrogate pairs. I guess this API was written with the future in
> mind, for when Java does fully support them?
You can convert to UTF-8, in which case it supports Unicode 3. If you want
Unicode 3, however, you have to upgrade to XML 1.1.
Malachi
More information about the jdom-interest
mailing list