Scraper / Spider / Extractor

Avg Bid (USD)
Project Budget (USD)
$30 - $100

Project Description:
Brief summary:
I am looking to get a scrapper build to extract Name and phone number from classified websites

Detailed Requirements:
The scraper / spider should go automatically when started to visit the following websites,

visit listings one by one from the car section and extract names, phone number and email if available. Numbers starting with “2” or “5” should be ignored. A text file should be generated with comma separated values to import the data easy into another program. The software also has to check for duplicate entries and has to be able to run a “Blacklist” to automatically remove numbers from people who choose to be excluded. There should be the option to split the file into smaller parts / lists. The scraper has to be able to ignore errors and keep working without to get stuck for some reason. Just in case one of the website changes their layout or something else.

I have to be able to watch the progress while the program is running, it doesn't really matter the time the scraper needs to get all the info, I can run this over night if need be. Please be prepared for a small add-on or change if something come into my mind while we do this.

I had something like this done a couple of years back but much changed and it’s not working anymore. Also the previous version was simpler and didn't have all the requested features at the time. I think I can dig up the source code if needed to help the new coder.

Skills required:
PHP, Software Architecture
About the employer:
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.

Hire mantislin
$ 248
in 5 days
$ 100
in 4 days
Hire Days
$ 100
in 1 days
$ 99
in 5 days
$ 200
in 10 days
Hire johnred332
$ 100
in 3 days