[jdom-interest] UTF-8 coding IOException

philip.nelson at omniresources.com philip.nelson at omniresources.com
Fri May 11 07:02:22 PDT 2001


As always with bug reports, include the Xerces version, get the latest jdom
version from CVS and if you can provide a code sample we could run, that
will speed up the response tremendously.

In this case, I would try to use SAXBuilder and see if the problem exists
there.  JDOM has been *very* carefully checked that it will accept non ascii
valid xml characters.  If it really is a bug, it has to do with Xerces
itself or the interface between Xerces DOM and JDOM.

> -----Original Message-----
> From: Michael Tarovik [mailto:tmn at baltics.ru]
> Sent: Friday, May 11, 2001 6:06 AM
> To: jdom-interest at jdom.org
> Subject: [jdom-interest] UTF-8 coding IOException
> 
> 
> Use xerces with 
> <?xml version='1.0' encoding='UTF-8'?><tds>
>   <row><а│аАб┌аА_б┐аБб▀б┌аИб│>09.05.2001
> 17:36:00</а│аАб┌аА_б┐аБб▀б┌аИб│></row>
> </tds>
> where element after <row> realy is (in hex) D094 D0B0 D182 
> D0B0 5F D183
> D0B1 D18B D182 D0B8 D18F
> I've got the next:
> 'Element type "-" must be folloved by either attribute specification?
> ">" or ">" '
>   at org.jdom.adapter.XercesDOMAdapter.getDocument
> So I think it can't corectly read input text.
> By the way this xml was read correctly by IE5.5 and NN 6.
> _______________________________________________
> To control your jdom-interest membership:
> http://lists.denveronline.net/mailman/options/jdom-interest/yo
> uraddr at yourhost.com
> 



More information about the jdom-interest mailing list