I am building a JDOM tree from some HTML and it is barfing with an error "Unconvertable UTF character". The problem seems to be caused by the soft-hyphen "shy" character ­ as well as the copyright symbol character. Can anyone shed any light on why these would cause JDOM SAXBuilder a problem?