i have a few pdfs that needs to be extracted into xml
the php script must have a upload feature and will need to read scanned image pdf. meaning the pdf is actually a scanned copy. some pdf have 2 pages
the script MUST be able to correctly extract all the content by doing OCR.
things needed from pdf
1) sku or reference number
2) invoice number
5) total amount
take note, there are 6 different types of pdf. meaning 6 different layouts.