|
Posted by dummy on February 28, 2008, 5:03 am
Please log in for more thread options
On Wed, 27 Feb 2008 14:12:56 -0800 (PST), j ellings
>Hello.
>
>I have an html file converted from PDF that includes the following
>sample lines:
>
[snip]
>
>I am trying to capture the information between the <i><b>
>tags as these are the only unique delimiters between entries.
>
>My regex is as follows:
>
>while ($html =~ mgs) {
>#do something
>}
>
>Unfortunately, the regex will match the first instance( Z & A
>Newsstand), but ignore the second (Newstand) and then match on the
>third (Pudgies Deli).
>
>I can see that the match is working according to what I wrote; I am
>trying to fine tune it so that I can grab every match. Is there a way
>to include the previous <i><b> in the next match such that
>it will not skip a potential match?
>
>Any suggestions or advice would be most appreciated.
>
>John
while ($html =~ mg){print "$1\n"}
works for me
|