Page 1 of 1

Extract information from PDF

Posted: Tue Mar 29, 2011 11:05 pm
by Kleyton
Good evening friends
Does anyone know how to extract certain information from PDF files, via five?

Re: Extract information from PDF

Posted: Wed Mar 30, 2011 4:24 pm
by James Bott
Kleyton,

PDF files are really not designed to be data sources. As you know PDF's can be incredibly complex with mixtures of graphics and text and lots of configuration data.

If your PDFs are simple and always the same format, then perhaps it would be feasible. With complex PDFs with varying formats it could quickly become a monumental task.

James

Re: Extract information from PDF

Posted: Thu Mar 31, 2011 6:15 pm
by Kleyton
I indicated that DLL, also works with activex.

http://www.foxitsoftware.com/shopping/c ... ?suggest=0

Re: Extract information from PDF

Posted: Fri Apr 01, 2011 12:59 am
by Bayron
Maybe if you explain a little better what you are trying to do, someone can help you...

Are you trying to read a PDF file???
Are you trying to edit a PDF file???
Are you trying to fill a PDF form???
Are you trying to extract information about the file???

Re: Extract information from PDF

Posted: Fri Apr 01, 2011 7:27 am
by MarcoBoschi
My little implementation in a existing project.
Pdf documents arrive by email in advance daily.
Their contents are transport documents.
Warehouseman open pdf document using Acrobat reader (what else? ) .
(before you import something into a program I still need to check the contents)

Perform CTRL-A and CTRL-C (the content is copied to the clipboard).

In my program click a NEW button that reads from clipboard the contents and
extracts codes, quantities and prices.

Minimum effort maximum efficiency

Probably this is not an answer to your question, is only an input

bye
marco

Re: Extract information from PDF

Posted: Fri Apr 01, 2011 7:58 am
by Richard Chidiak
try pdf toolkit , free and easy to use (command line)

I don't know what exactly you want to acheive with pdf but this tool offedrs a lot of possibilities, i use it to merge pdfs...etc

http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/

Hth

Richard