Click here to get back home

Delayed WEB Page Response

 HomeNewsGroups | Search | About
 comp.lang.perl.modules    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
Delayed WEB Page Response aage.gribskov 02-06-2008
Posted by aage.gribskov on February 6, 2008, 7:11 pm
Please log in for more thread options
I wish to capture data from a Web page e.g.
"http://www.eppraisal.com/PropertyInfo.aspx?a=1215%20Jefferson
%20Ave&z=46201"

I am using the LWP modules.
The page responds in three steps and I have succeeded in capturing
only the first.

The page first paints up nicely with "Loading" text in the area of
interest.
After a delay the "Loading" text is replaced with "Calculating".
Shortly thereafter, sometimes apparently instantaniously, the data of
interest appears.

I have tried LWP:: UserAgent and LWP::Parallel::UserAgent and capture
only the initial response.
TimeOut parameters do not change the behavior.
The callback subroutine indicates the HTML comes in several chunks.

How can the other responses be captured?
The documentation mentions LPW::Parallel::UserAgent::Entry objects
and follow up requests.
Will this be of help?
I have found no documentation of this feature.
Is there any additional documentation or examples?

Thanks,

AAge Gribskov

Posted by Mark Clements on February 7, 2008, 5:00 pm
Please log in for more thread options
aage.gribskov@gmail.com wrote:
> I wish to capture data from a Web page e.g.
> "http://www.eppraisal.com/PropertyInfo.aspx?a=1215%20Jefferson
> %20Ave&z=46201"
>
> I am using the LWP modules.
> The page responds in three steps and I have succeeded in capturing
> only the first.
>
> The page first paints up nicely with "Loading" text in the area of
> interest.
> After a delay the "Loading" text is replaced with "Calculating".
> Shortly thereafter, sometimes apparently instantaniously, the data of
> interest appears.
>
> I have tried LWP:: UserAgent and LWP::Parallel::UserAgent and capture
> only the initial response.
> TimeOut parameters do not change the behavior.
> The callback subroutine indicates the HTML comes in several chunks.
>
> How can the other responses be captured?
> The documentation mentions LPW::Parallel::UserAgent::Entry objects
> and follow up requests.
> Will this be of help?
> I have found no documentation of this feature.
> Is there any additional documentation or examples?

It's using javascript - which neither LWP nor WWW::Mechanize will
execute - to move between pages. You could try using
Win32::IE::Mechanize or Selenium, but both of these rely on controlling
a running browser.

Mark

Posted by damians on February 20, 2008, 12:01 pm
Please log in for more thread options
wrote:
> aage.gribs...@gmail.com wrote:
> > I wish to capture data from a Web page e.g.
> > "http://www.eppraisal.com/PropertyInfo.aspx?a=3D1215%20Jefferson
> > %20Ave&z=3D46201"
>
> > I am using the LWP modules.
> > The page responds in three steps and I have succeeded in capturing
> > only the first.
>
> > The page first paints up nicely with "Loading" text in the area of
> > interest.
> > After a delay the "Loading" text is replaced with "Calculating".
> > Shortly thereafter, sometimes apparently instantaniously, the data of
> > interest appears.
>
> > I have tried LWP:: UserAgent and LWP::Parallel::UserAgent and capture
> > only the initial response.
> > TimeOut parameters do not change the behavior.
> > The callback subroutine indicates the HTML comes in several chunks.
>
> > How can the other responses be captured?
> > The documentation mentions =A0LPW::Parallel::UserAgent::Entry objects
> > and follow up requests.
> > Will this be of help?
> > I have found no documentation of this feature.
> > Is there any additional documentation or examples?
>
> It's using javascript - which neither LWP nor WWW::Mechanize will
> execute - =A0to move between pages. You could try using
> Win32::IE::Mechanize or Selenium, but both of these rely on controlling
> a running browser.
>
> Mark

There is an API to some of our data. What data elements are you
looking to pull?

Send me an email or to info (at) eppraisal.com. Scraping the front-end
is time consuming and prone to errors (when we push out updates).

Damian (from eppraisal.com)

Similar ThreadsPosted
Sending "page up" and "page down" character problem. September 7, 2004, 1:59 pm
How should I announce the list of delayed class/object/method loaders? October 25, 2005, 9:54 am
check response with mod_perl October 22, 2004, 11:23 am
URGENT: no response - LWP::UserAgent February 26, 2005, 12:22 pm
Net::IRC and ctcp ping response September 14, 2006, 6:25 pm
lwp authentication on asp page March 23, 2005, 10:28 pm
No response from printer using Win32::SerialPort October 11, 2006, 11:36 am
HTTP::Response decoded_content is undefined March 28, 2007, 8:04 am
web page automatic update August 15, 2004, 2:32 am
Module to get text from a PDF page? January 6, 2005, 8:52 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap