[jdom-interest] CDATA inconsistency
bob mcwhirter
bob at werken.com
Sat Nov 2 08:47:38 PST 2002
> At 11:08 PM -0800 11/1/02, Malachi de AElfweald wrote:
> >It would be against XML spec to check the characters within the
> >CDATA, since the spec
> >says that CDATA is "unparsed character data". Seems like parsing it
> >wouldn't fit the description, eh?
> >
>
> No, that's not quite true. there are a number of characters which
> cannot appear in a CDATA section. These include many C0 controls such
> as null and vertical tab, unmatched halves of surrogate pairs, and a
> few other undefined code points. The three character sequence ]]> is
> also illegal.
I would like to take this opportunity to remind everyone that MIME exists
for a reason. When all you have a is a hammer (XML) everything looks like
a thumb (stuffing random binary in XML).
-bob
More information about the jdom-interest
mailing list