<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL//EN">

<HTML>

<HEAD>

  <META HTTP-EQUIV="Content-Type" CONTENT="text/html; CHARSET=UTF-8">

  <META NAME="GENERATOR" CONTENT="GtkHTML/3.0.9">

</HEAD>

<BODY>

While a SAX parser is at liberty to supply any number of character events in a sequence, I was under the impression that a DOM should never contain multiple, sequential text nodes. Could this not be a bug in JDOM's character event collation?<BR>

<BR>

Phil :n.<BR>

<BR>

On Wed, 2004-02-04 at 22:03, Robin Rosenberg wrote:

<BLOCKQUOTE TYPE=CITE>

<PRE><FONT COLOR="#737373"><I>This is the result of the underlying SAX Parser, not JDCOM itself,

which splits the character data. It is formally correct, and very annoying. 

I had never encountered this until I started parsing huge files.

-- robin

onsdag 04 februari 2004 22:47 skrev Srinivasan Pillai:

&gt; I have quite large xml file (around 1 MB). I used JDOM to modify the

&gt; contents (using XmlOutputter). When I do so for large files, the CDATA

&gt; elements divides itself into 2 or more CDATA elements even though I make no

&gt; change to it.

&gt; For example, this is the original CDATA element I had in the file,

&gt;

&gt; &lt;long_desc&gt;&lt;![CDATA[Test description]]&gt;&lt;/long_desc&gt;

&gt;

&gt; and this is what I saw after the file was modified

&gt; &lt;long_desc&gt;&lt;![CDATA[Test d]]&gt;&lt;![CDATA[escription]]&gt;&lt;/long_desc&gt;

[...]

_______________________________________________

To control your jdom-interest membership:</FONT>

<A HREF="http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourhost.com"><U>http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourhost.com</U></I></A></PRE>

</BLOCKQUOTE>

<PRE><TABLE CELLSPACING="0" CELLPADDING="0" WIDTH="100%">

<TR>

<TD>

-- <BR>

Phil Weighill-Smith &lt;<A HREF="mailto:phil.weighill-smith@volantis.com"><U>phil.weighill-smith@volantis.com</U></A>&gt;<BR>

Volantis Systems

</TD>

</TR>

</TABLE>

</PRE>

</BODY>

</HTML>