Click here to get back home

RegEx - matching previous match

 HomeNewsGroups | Search | About
 comp.lang.perl.misc    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
RegEx - matching previous match j ellings 02-27-2008
Posted by j ellings on February 27, 2008, 5:12 pm
Please log in for more thread options
Hello.

I have an html file converted from PDF that includes the following
sample lines:

(html has been converted)

<i><b>Z & A Newsstand</b></i><br>
<i>Retail Food: Mobile Food Vendor</i><br>
<i>2 N 10th St</i><br>
<i>Philadelphia, PA 19107</i><br>
<b>Inspection Date</b><br>
<i>4/11/07</i><br>
No Critical Violations<br>
<i>4/11/07</i><br>
No Critical Violations<br>
<i>11/28/06</i><br>
No Critical Violations<br>
<i>4/24/06</i><br>
No Critical Violations<br>
<i><b>Newstand</b></i><br>
<i>Retail Food: Mobile Food Vendor</i><br>
<i>32 N 10th St</i><br>
<i>Philadelphia, PA 19107</i><br>
<b>Inspection Date</b><br>
<i>7/2/07</i><br>
No Critical Violations<br>
<i><b>Pudgies Deli</b></i><br>
<i>Retail Food: Restaurant, Eat-in</i><br>
<i>46 N 10th St</i><br>
<i>Philadelphia, PA 19107</i><br>
<b>Inspection Date</b><br>
<i>1/11/07</i><br>
No Critical Violations<br>
<i>9/25/06</i><br>
No Critical Violations<br>
<i>8/7/06</i><br>
No Critical Violations<br>


I am trying to capture the information between the <i><b>
tags as these are the only unique delimiters between entries.

My regex is as follows:

while ($html =~ mgs) {
#do something
}

Unfortunately, the regex will match the first instance( Z & A
Newsstand), but ignore the second (Newstand) and then match on the
third (Pudgies Deli).

I can see that the match is working according to what I wrote; I am
trying to fine tune it so that I can grab every match. Is there a way
to include the previous <i><b> in the next match such that
it will not skip a potential match?

Any suggestions or advice would be most appreciated.

John

Any


Similar ThreadsPosted
RegEx - matching previous match February 27, 2008, 5:12 pm
Multi-Match (to Array) Regex with a precodition match? August 5, 2007, 2:43 pm
Regex not matching May 15, 2005, 4:37 am
REGEX NAME Matching.. June 23, 2005, 5:11 pm
RegEx Help, Please? (match after n) June 26, 2005, 10:49 pm
regex to match any url February 14, 2006, 4:02 pm
Matching substrings within a Regex May 16, 2006, 12:41 pm
regex matching exactly 10 digits November 28, 2006, 8:58 am
get the matching regex pattern March 20, 2008, 9:16 am
regex back matching June 5, 2008, 8:53 am

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap