I am looking to get a scrapper build to extract Name and phone number from classified websites
The scraper / spider should go automatically when started to visit the following websites,
visit listings one by one from the car section and extract names, phone number and email if available. Numbers starting with “2” or “5” should be ignored. A text file should be generated with comma separated values to import the data easy into another program. The software also has to check for duplicate entries and has to be able to run a “Blacklist” to automatically remove numbers from people who choose to be excluded. There should be the option to split the file into smaller parts / lists. The scraper has to be able to ignore errors and keep working without to get stuck for some reason. Just in case one of the website changes their layout or something else.
I have to be able to watch the progress while the program is running, it doesn't really matter the time the scraper needs to get all the info, I can run this over night if need be. Please be prepared for a small add-on or change if something come into my mind while we do this.
I had something like this done a couple of years back but much changed and it’s not working anymore. Also the previous version was simpler and didn't have all the requested features at the time. I think I can dig up the source code if needed to help the new coder.
The Coder has to create a free standing application or program, I am not a programmer or super user, I cannot install a DB or a PHPadmin, a Web server program or something like this. This has to be very simple for me to use and once working, I would like the coder to be available for future addons or changes to the program which of course will be paid separate. The code for the program or application has to get delivered at completing the task. Before you accept, please visit all the websites and make sure you understand what's involved in this. if you are not sure, asked First ! Good luck to all Bidders !