Extract data from online PDF files into database

This project received 13 bids from talented freelancers with an average bid price of $210 USD.

Get free quotes for a project like this
Project Budget
Total Bids
Project Description


I am looking for an experienced Web Developer with advanced PDF extraction skills.

My objective is to have a script that can regularly visit a 12 different websites and check an URL on that page to see if a new PDF has been made available.

For every updated PDF the script needs to extract all data and insert into a MySQL DB and keep track of the insert in the form of a batch system using a date identifier of the last batch.

The new batch information must also become visible in the Admin panel.

Please contact me so I can send you a .xls with details about the websites and the URL that point to the PDF.

Note: the URL destination can change every time a new PDF is made available there the script must be “smart” in order to detect updated PDF files. Also the format of the PDF might change in the future. Therefore the script needs to be scalable and adaptable against low costs changes.

Skills required:

- Advanced PDF Extraction experience



Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online