Click here to get back home

Re: HTML Parsing issues - Part II

 HomeNewsGroups | Search | About
 comp.lang.perl.misc    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
Re: HTML Parsing issues - Part II Ben Bullock 05-25-2008
Posted by Ben Bullock on May 25, 2008, 8:32 am
Please log in for more thread options
On Sat, 24 May 2008 23:20:05 -0700, chadda wrote:

> <table class="item_description">
^^^^^
> Acer
> Aspire AS5610-2089 Notebook, Intel Pentium Dual Core
> T2080, 1.6 GHz, 1024GB, 160GB, DVD+/-R DL/DVD+RW Drive, 15.4" TFT,
> WebCam, 56K Modem, Wireless, NIC, Vista Home Premium, Refurbished with
> 90 Day Warranty</td>

Contains not even one colon.

> if ( lc $token->[1] eq 'item_description' ) {
^^
> while ( $cell =~ /\s*([^:]+?):\s+(\d+)\s+/g ) {
^

Similar ThreadsPosted
Wide character issues with HTML Tidy September 12, 2006, 9:12 pm
Hash references & parsing HTML with HTML::Parser February 16, 2005, 8:10 pm
Replacing '&' that is not part of an HTML entity July 19, 2004, 1:13 pm
Parsing HTML - using HTML::TreeBuilder October 5, 2006, 2:05 pm
Parsing html by XML::libXML August 12, 2004, 2:59 am
Parsing HTML using TreeBuilder - how to get the "next" tag? June 12, 2005, 11:04 pm
parsing and replacing html table to div March 24, 2005, 11:33 pm
HTML::TableExtract punctuation parsing May 22, 2005, 10:37 pm
Parsing javascript with html::tokeparser December 29, 2005, 11:18 pm
2 issues with "tie" August 26, 2007, 9:32 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap