# Question: need to parse web pages to extract data

#### Do you have a question? Post it now! No Registration Necessary.  Now with pictures!

•  Subject
• Author
• Posted on
Hi,

The site is:
www.homepriceguide.com.au

A sample page with data can be seen at:
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6153&source=apm

The only thing that changes is the postcode so the next page in line will
be:
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6154&source=apm

etc etc

What I'm trying to do is to extract price info and save it to a file where
each record has the postcode as its ID. Last year I wrote a script that went
through the site and gathered the data for me and dumped the results in a
file. Unfortunately it's gone walking somewhere. Can someone pls remind me
which module is best to be used here (I'm mainly concerned with the parsing
side right now)? I have not coded for <> 12mths so I'm a bit rusty now but
hopefully it will all come back.

Let me know if the above is not clear.

Voitec

## Re: Question: need to parse web pages to extract data

> Hi,
>
> The site is:
> www.homepriceguide.com.au
>
> A sample page with data can be seen at:
>
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6153&source=apm
>
> The only thing that changes is the postcode so the next page in line will
> be:
>
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6154&source=apm
>
> etc etc
>
> What I'm trying to do is to extract price info and save it to a file where
> each record has the postcode as its ID. Last year I wrote a script that
> went
> through the site and gathered the data for me and dumped the results in a
> file. Unfortunately it's gone walking somewhere. Can someone pls remind me
> which module is best to be used here (I'm mainly concerned with the
> parsing
> side right now)? I have not coded for <> 12mths so I'm a bit rusty now but
> hopefully it will all come back.

--
Wyzelli
=sub{scalar reverse shift}}=>
=>=>=>=>{}}}}}}
>
>

## Re: Question: need to parse web pages to extract data

> What I'm trying to do is to extract price info
---------

try module LWP::Simple and get() to grab the source... then extract
what you want....

d