SV: [jdom-interest] JDOM and text outside tags

Per Norrman pernorrman at telia.com
Wed Oct 22 23:41:42 PDT 2003


Hi,

getText() on the SMALL element should get you
a concatenated string, " some_text  some_other_text ".

If you want the pieces individually, you have to
iterate through the content of the SMALL element and
pick up the the value of each Text node in question.

/pmn

> -----Ursprungligt meddelande-----
> Från: jdom-interest-admin at jdom.org 
> [mailto:jdom-interest-admin at jdom.org] För Jacques-Albert De Blasio
> Skickat: den 22 oktober 2003 11:06
> Till: jdom-interest at jdom.org
> Ämne: [jdom-interest] JDOM and text outside tags
> 
> 
> Hi all,
> 
> I have a problem with JDOM and I am sure that one of you JDOM 
> guru could 
> help me out :)
> 
> In a program I'm writing, I first fetch HTML pages on the 
> web, tidy them 
> with NekoHTML (JTidy was not sufficient as it could not parse 
> japanese 
> html pages) and then transform the DOM outputed by NekoHTML into JDOM 
> documents.
> 
> My problem is the following: in a given page, I have tags such as
> 
> <TD>
> <SMALL>
> <IMG src = "..." /> some_text <BR />
> <IMG src =" ..." /> some_other_text <BR />
> </SMALL>
> </TD>
> 
> How can I fetch the "some_text" and "some_other_text" ?
> 
> Thank you very much for your help,
> 
> Jack
> 
> _______________________________________________
> To control your jdom-interest membership: 
> http://lists.denveronline.net/mailman/options/jdom-interest/yo
uraddr at yourhost.com




More information about the jdom-interest mailing list