[OGo-Developer] parsing HTML
Helge Hess
developer@opengroupware.org
Thu, 15 Feb 2007 01:58:56 +0100
On Feb 14, 2007, at 21:45, Wolfgang Sourdeau wrote:
> Again, my answer is that I was not using the correct parser,
> although I think any XML/SGML parser should handle those entities.
Any non-XML entity (like eacute) must be properly defined in the DTD
if you use an XML parser. And for validated XML you need to use the
libxml2DocParser backend, the default parser is just a wellformed XML
parser.
If you use an HTML parser, all the HTML entities are builtin of course.
Greets,
Helge
PS: Not sure why you refer to an SGML parser, we don't provide one?
Not sure which entities are builtin in generic SGML.
--
Helge Hess
http://www.helgehess.eu/