[jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML Basi
c 1.0//EN" "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd"> error?
philip.nelson at omniresources.com
philip.nelson at omniresources.com
Mon Nov 19 23:39:50 PST 2001
>
> This is the only thing that is in the document. These pages
> come from another web application that my company will not
> allow to be modifed.
>
> <!DOCTYPE html
> PUBLIC "-//W3C//DTD XHTML Basic 1.0//EN"
> "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
>
> It JTidy has no problem building a DOM object with these
> files, but JDOM throws errors because of the DOCTYPE
> declaration. I need to know how build a JDOM document that
> contains this type of DOCTYPE. I would be happy if I could
> just ingore it.
Jtidy is outputing a DOM with this doctype, correct? If you look at this
doctype, it is full of parameter entities which are then expanded in the
dtd. I don't know this dtd but you can probably expect that there are parts
of the result document that don't match this dtd. So, as far as I can tell
from skimming this thread, Jtidy saw only the original html document,
according to whatever options you set, output a dom with this doctype. In
this document, somehow I don't fully understand, something is interpeted as
a PI instead of an attribute or element. Since this is DOMBuilder, a bug is
not out of the question but what we really need I think is the xml version
of the DOM document Jtidy output.
It could turn out to be a parser bug, a Jtidy bug, a DOMBuilder bug or
something I haven't thought of ;-)
More information about the jdom-interest
mailing list