[jdom-interest] Questions regarding implementation of DocTyp
e.internalSubset[eg]
philip.nelson at omniresources.com
philip.nelson at omniresources.com
Wed Jun 13 14:03:12 PDT 2001
> > * what to do (if anything) about character entities in the
> source doc like
> > <!ENTITY Ouml 'Ö'>
> >
> > The parser turns this into a String from the parsed entity
> and that is what
> > gets output.
>
> Try to create a string as close to the original as possible.
I think all the other questions have been addressed and I agree with the
conclusions from Jason and Harry. This one is not so easy. The problem is,
we have no way of knowing whether the original data came from a character
entity or not. The parser doesn't report this in any way I could find. we
just get a Unicode string.
Harry, off the top of my head, I have modified DocType, XMLOutputter,
SAXBuilder and SAXHandler. None of this is in CVS. I'll send you what I
have tonight and if you could do the same, we may save a few steps. My code
seemed to work pretty well except for the issues I have brought up.
Could you send yours?
More information about the jdom-interest
mailing list