[jdom-interest] CDATA inconsistency
Elliotte Rusty Harold
elharo at metalab.unc.edu
Sun Nov 3 04:12:52 PST 2002
At 9:22 PM -0800 11/2/02, Malachi de AElfweald wrote:
>Ok, so the real issue is that the characters are being added as
>binary data instead of as Java chars then?
No. It's not. The problem is that a Java char is not a Unicode
character and thus not an XML character.
>That would mean that
>grabbing the data from the initial source would be the problem?
Not really.
>So, if the original data were grabbed via a BufferedReader via
>an InputStreamReader using the correct encoding, would that not
>make sure that the data had the correct surrogate pairs internally,
>since they would all be valid Java characters?
>
A Reader might fix this, but might not, depending on what the
original encoding was.
--
+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo at metalab.unc.edu | Writer/Programmer |
+-----------------------+------------------------+-------------------+
| XML in a Nutshell, 2nd Edition (O'Reilly, 2002) |
| http://www.cafeconleche.org/books/xian2/ |
| http://www.amazon.com/exec/obidos/ISBN%3D0596002920/cafeaulaitA/ |
+----------------------------------+---------------------------------+
| Read Cafe au Lait for Java News: http://www.cafeaulait.org/ |
| Read Cafe con Leche for XML News: http://www.cafeconleche.org/ |
+----------------------------------+---------------------------------+
More information about the jdom-interest
mailing list