|
Posted by H. Wade Minter on January 6, 2005, 8:52 pm
Please log in for more thread options
I'm looking for a Perl module that will give me the text from a page of a simple
(uncompressed, unencrypted) PDF. I've found several modules on CPAN that will
write text into PDFs, but nothing to get it out.
The closest possibilities look like PDF::API2 and Text::PDF. I've been working
with them, and they seem to be able to get at a lot of meta-information in a
PDF, but
unable to get at the actual text in the file.
My workaround is to shell out to pdftotext to get the text, but I'd like to have
a pure-perl solution if possible. Does anyone know of a module that can do this?
Thanks,
Wade
|