You have chosen to sponsor your bid up to a maximum amount of .
I am looking to get a scrapper build to extract Name and phone number from classified websites
The scraper / spider should go automatically when started to visit the following websites,
visit listings one by one from the car section and extract names, phone number and email if available. Numbers starting with “2” or “5” should be ignored. A text file should be generated with comma separated values to import the data easy into another program. The software also has to check for duplicate entries and has to be able to run a “Blacklist” to automatically remove numbers from people who choose to be excluded. There should be the option to split the file into smaller parts / lists. The scraper has to be able to ignore errors and keep working without to get stuck for some reason. Just in case one of the website changes their layout or something else.
I have to be able to watch the progress while the program is running, it doesn't really matter the time the scraper needs to get all the info, I can run this over night if need be. Please be prepared for a small add-on or change if something come into my mind while we do this.
I had something like this done a couple of years back but much changed and it’s not working anymore. Also the previous version was simpler and didn't have all the requested features at the time. I think I can dig up the source code if needed to help the new coder.