|
Posted by John Bokma on May 12, 2006, 12:34 pm
Please log in for more thread options
> Suppose I want to translate an HTML to a XML-well-formed HTML (so that
> I can, e.g., apply xsltproc to the result). E.g., HTML::TreeBuilder
> can apply "usual heuristics" to parse HTML; how to get XML out of it?
Question: I use XML, not XHTML, at home, and use XML::Twig to convert it
to HTML. I can use xsltproc if I want to on the XML file.
You might want to traverse the parse tree HTML::TreeBuilder generates.
Also, not 100% sure, but it might me that HTML tidy can do the XHTML
conversion for you:
Google...
"Validator fixes errors in HTML and XHTML. Converts HTML to XHTML. Free
Software."
http://www.google.com/search?q=html%20tidy%20xhtml
Sounds like it does :-D.
--
John Bokma Freelance software developer
&
Experienced Perl programmer: http://castleamber.com/
|