Click here to get back home

PDF::API2 - Extracting text and position from PDF file

 HomeNewsGroups | Search | About
 comp.lang.perl.modules    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
PDF::API2 - Extracting text and position from PDF file Brian Miller 04-17-2008
Posted by Brian Miller on April 17, 2008, 9:20 am
Please log in for more thread options
Hello All,

I am new to PDF files so I don't really know if what I want to do is
possible and how to use the PDF::API2 modules.

I need to extract information from columns in a table ( I assume that
PDF does not know anything about tables). What I was thinking of doing
was finding the horizontal location of the header (I know what it should
be), then extract all text that starts at that location.

I have played around with the PDF::API2 module and read the 'Using
PDF::API2 - The code' help page, however it doesn't show me how to
extract information from an existing file. Could someone point me in the
right direction for some documentation or examples of how this might be
done, or if it can be done?

Thanks In Advance
Brian

Similar ThreadsPosted
newbie extracting same text from earch results October 7, 2005, 7:46 pm
RFC: new module Text::Tagged::InDesign::File and related December 10, 2004, 1:31 pm
PDF::API2 January 10, 2006, 7:13 pm
PDF::API2 and Barcode October 14, 2004, 9:38 am
Perl module PDF::API2 July 11, 2004, 5:35 am
PDF::API2 stream problems November 3, 2004, 6:53 am
CPAN not extracting documentation August 8, 2005, 6:28 pm
Extracting value from data within an element. November 30, 2005, 5:26 am
Extracting strings delimited by other strings May 7, 2005, 11:54 am
install HTML::Template - Problem reading cache file / Bad file number July 24, 2004, 7:55 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap