<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL//EN">
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; CHARSET=UTF-8">
<META NAME="GENERATOR" CONTENT="GtkHTML/3.0.9">
While a SAX parser is at liberty to supply any number of character events in a sequence, I was under the impression that a DOM should never contain multiple, sequential text nodes. Could this not be a bug in JDOM's character event collation?<BR>
On Wed, 2004-02-04 at 22:03, Robin Rosenberg wrote:
<PRE><FONT COLOR="#737373"><I>This is the result of the underlying SAX Parser, not JDCOM itself,
which splits the character data. It is formally correct, and very annoying.
I had never encountered this until I started parsing huge files.
onsdag 04 februari 2004 22:47 skrev Srinivasan Pillai:
> I have quite large xml file (around 1 MB). I used JDOM to modify the
> contents (using XmlOutputter). When I do so for large files, the CDATA
> elements divides itself into 2 or more CDATA elements even though I make no
> change to it.
> For example, this is the original CDATA element I had in the file,
> <long_desc><![CDATA[Test description]]></long_desc>
> and this is what I saw after the file was modified
> <long_desc><![CDATA[Test d]]><![CDATA[escription]]></long_desc>
To control your jdom-interest membership:</FONT>
<PRE><TABLE CELLSPACING="0" CELLPADDING="0" WIDTH="100%">
Phil Weighill-Smith <<A HREF="mailto:firstname.lastname@example.org"><U>email@example.com</U></A>><BR>