Click here to get back home

HTML::TreeBuilder eating my entities using perl 5.8.x

 HomeNewsGroups | Search | About
 comp.lang.perl.modules    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
HTML::TreeBuilder eating my entities using perl 5.8.x Michael Michalowski 05-18-2005
Get Chitika Premium
Posted by Michael Michalowski on May 18, 2005, 3:08 am
Please log in for more thread options


hi there!

I'm using HTML-TreeBuilder to parse HTML code that actually contains
entities in it's text nodes. Working with perl 5.6.1 and the newest
HTML::TreeBuilder/HTML::Tree/HTML::Parser versions was satisfying, but
using perl 5.8.4 there is a problem with a different entity treatment.
Perl 5.8.4 translates the entities in the text nodes (such as ⚔)
into unicode characters (2 bytes) and actually doesn't ask me before
;-)
Is there a possibility to make HTML::TreeBuilder with perl 5.6.x and
perl 5.8.x react the same way (storing entities as they are)?

best regards,
Michael



Similar ThreadsPosted
Possible Issue with HTML::TreeBuilder? July 5, 2005, 6:49 am
HTTP::TreeBuilder problems July 29, 2004, 12:19 am
How can I make entities Not expanded by XML::DOM December 12, 2006, 11:28 am
[ANNOUNCE] MathML::Entities::Approximated (take II) December 13, 2005, 7:03 pm
HTML to XML in Perl? May 12, 2006, 7:05 am
Quick Perl, HTML, CSS, JavaScript reference April 26, 2006, 10:34 am
HTML::Mason, mod_perl on Win32 w/ActiveState Perl December 21, 2004, 11:46 pm
CGI::StringDB Embedding perl data structures in an HTML post. July 10, 2005, 9:13 pm
[RFC] HTML::Dashboard (Spreadsheet-like formatting for HTML tables) April 16, 2007, 4:50 pm
I want an perl module for conver large html page file to multi little pages November 14, 2004, 3:02 am

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap