[jdom-interest] CDATA inconsistency

bob mcwhirter bob at werken.com
Sat Nov 2 08:47:38 PST 2002


> At 11:08 PM -0800 11/1/02, Malachi de AElfweald wrote:
> >It would be against XML spec to check the characters within the 
> >CDATA, since the spec
> >says that CDATA is "unparsed character data". Seems like parsing it 
> >wouldn't fit the description, eh?
> >
> 
> No, that's not quite true. there are a number of characters which 
> cannot appear in a CDATA section. These include many C0 controls such 
> as null and vertical tab, unmatched halves of surrogate pairs, and a 
> few other undefined code points. The three character sequence ]]> is 
> also illegal.

I would like to take this opportunity to remind everyone that MIME exists
for a reason.  When all you have a is a hammer (XML) everything looks like
a thumb (stuffing random binary in XML).

	-bob




More information about the jdom-interest mailing list