scrape data from pdf files

This project received 9 bids from talented freelancers with an average bid price of $167 USD.

Get free quotes for a project like this
Project Budget
$30 - $250 USD
Total Bids
Project Description

Attached are 3 sample PDF files.

I want to know if it's possible to scrape the data from the files and store the data as texts in a mysql database.

So what i want is a program i can run myself and possibly adapt over time.

I need to differentiate between the dutch and french pieces of text,

i also need to differentiate between sections and titles and actual text so i can use the titles to scrape what i need and skip the rest.

Please indicate how accurate this scraping can be, what % accuracy can a program achieve and what % will need to be checked manually.

I'd prefer a program in Java but i'll also consider PHP, no other languages.

I am familiar with these languages myself as well as mysql but not with the pdf format.

Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online