This project received 17 bids from talented freelancers with an average bid price of $376 USD.

Get free quotes for a project like this
Employer working
Project Budget
$250 - $750 USD
Total Bids
Project Description

We are interested in extract text from PDF special files ( image files) . OCR is necessary. The PDF files are obtained from a .xps image using a virtual printer ( CUTE PDF). As you will see on the example, each page has two columns. Using ABBYY for example some times the columns are mixed. We are interested in scapping the text without mixing the columns. We will need a automatization process and a long therm collaboration because there are about 30-40 from this documnents daily.

So, we need to extract correctly text from this PDF . The language is Romanian. The text can be saved in one file/ document.

Please find out a PDF attached for an example.

Best regards


Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online