[jdom-interest] SaxBuilder.build(url) and encoding
Jason Hunter
jhunter at servlets.com
Wed Dec 11 21:34:25 PST 2002
When you use a URL the underlying parser determines the encoding,
typically by looking at the declaration. If there isn't one, it assumes
UTF-8. You can even look at the JDOM code to determine exactly what
goes on. But this issue's with your parser or your file contents. Most
likely you have a wrongly encoded file, since parsers tend to be pretty
good about this.
-jh-
Rodrigo Alvarez wrote:
>
> Hi,
>
> I´ve a problem with encoding and using SaxBuilder.build(url) method.
> It seems that the SaxBuilder.build(url) method does not check for the
> encoding attribute in the xml head tag
> Is this correct, or is it just me?
> The following xml does give a JDOMException[Malformed UTF-8 char] when read
> by SaxBuilder.build(url) but works
> fine when read by SaxBuilder.build(File) or SaxBuilder.build(String)
>
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <dybox>
> <current_date week_day="mié">
> <day>11</day>
> <month>12</month>
> <year>2002</year>
> <hour>15</hour>
> <minute>32</minute>
> <second>33</second>
> <formated_date>20021211</formated_date>
> </current_date>
> </dybox>
>
> My solution to the problem was read URL stream into a StringBuffer and then
> build Document object.
> I´m not satisfied with this solution so I wonder if someone else have had a
> similar problem or can confim that
> there really is a problem with JDOM.
>
> /Rodrigo
>
> Rodrigo Alvarez
> DyBOX Consulting and Development
>
> Hernando de Aguirre 906 Providencia.
> Santiago, Chile.
> (562) 231 7840
>
> _______________________________________________
> To control your jdom-interest membership:
> http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourhost.com
More information about the jdom-interest
mailing list