Closed

Text Scraping from pdf to excel spread sheet

This project received 34 bids from talented freelancers with an average bid price of $30 USD.

Get free quotes for a project like this
Employer working
Skills Required
Project Budget
$10 - $30 USD
Total Bids
34
Project Description

This will be an easy project for a programmer who knows how to do it.
I have a pdf with 57,865 records. A screenshot has been uploaded to show the data format. I want the text to be scraped programatically and the data saved in a excel file (xls). Each row of the spreadsheet should correspond to a single record in the pdf document. The fields in the spreadsheet should be as below. Just to know that you have actually read the project description, Please begin your reply with the words "ice-cream soda"

serial number - the first box on the left with numbers
Name - the text in bold
City/Town - The word preceeding (just before) the word "Karnataka"
Address - the words after "Name" and before "City/Town"
Post Code - the numbers after the word "Karnataka" Please note, only some entries have the Post Code. Leave blank where there is no information.
Degree - the third box
Remarks - The last box. Please note, only some entries have the Remarks. Leave blank where there is no information.

The work should be done programatically and not manually. Quick turn-around is required.


Regards

MJ Reddy

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online