scrape data from pdf files
This project received 9 bids from talented freelancers with an average bid price of $167 USD.Get free quotes for a project like this
Browse Related Skills
Other things people do on Freelancer
Project Budget$30 - $250 USD
Attached are 3 sample PDF files.
I want to know if it's possible to scrape the data from the files and store the data as texts in a mysql database.
So what i want is a program i can run myself and possibly adapt over time.
I need to differentiate between the dutch and french pieces of text,
i also need to differentiate between sections and titles and actual text so i can use the titles to scrape what i need and skip the rest.
Please indicate how accurate this scraping can be, what % accuracy can a program achieve and what % will need to be checked manually.
I'd prefer a program in Java but i'll also consider PHP, no other languages.
I am familiar with these languages myself as well as mysql but not with the pdf format.
Looking to make some money?
- Set your budget and the timeframe
- Outline your proposal
- Get paid for your work
Hire Freelancers who also bid on this project
Looking for work?
Work on projects like this and make money from home!Sign Up Now
- The New York Times
- Wall Street Journal
- Times Online