[jdom-interest] Outputting Entity reference for non US-ASCII characters

Benjamin Kopic benjamin.kopic at panContext.com
Fri Oct 17 03:40:01 PDT 2003


Great stuff Alex. It is exactly what I needed (can't wait for 1.0b10 as
it has some really powerfull stuff).

Best regards

Ben

On Thu, 2003-10-16 at 15:14, Alex Rosen wrote:

> If I remember correctly JDOM now (in the current CVS tree, not in beta
> 9) will automatically escape any characters that the output encoding
> can't handle. So if you just tell it to output in US-ASCII, it'll turn
> all chars > 127 into entities.
> 
> (Note that there were some API changes after beta 9 so you'll have to
> do a little work to use the latest code.)
> 
> Alex
> 
> >>> Benjamin Kopic <benjamin.kopic at panContext.com> 10/16/2003 4:37:39
> AM >>>
> Hi
> 
> I need to write some sort of Entity handling routine that converts all
> of the non US-ASCII characters
> to their SGML Entity reference. There was some discussion on this
> subject way back, but I am not sure
> what came out of it. All of the documents I need to produce have to
> comply to the following restriction:
> http://www.ncbi.nlm.nih.gov/entrez/query/static/entities.html
> 
> What would be the best way:
> 
> a) write EntityRef for each one of these and then let JDOM
> XMLOutputter
> do the conversion (I assume it
> does it)
> 
> b) write my own String conversion utility that converts the chars
> outside 127 bit range to their entity
> ref value.
> 
> Actually, what I really would like to know is if JDOM would convert a
> Unicode String to an XML String
> that is valid for a particular encoding (i.e. US-ASCII) simply by
> registering EntityRef for each of
> the characters outside the range for the given encoding?
> 
> Best regards
> 
> Benjamin

-- 
benjamin kopic
m: +44 (0)780 154 7643
t: +44 (0)20 7794 3090
e: benjamin.kopic at panContext.com
w: http://www.panContext.com/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://jdom.org/pipermail/jdom-interest/attachments/20031017/d0b84839/attachment.htm


More information about the jdom-interest mailing list