Need a multithreaded bot to scrape a website for their inventory. The inventory will need to be updated on hourly basis since their inventory changes regularly. Looking for well written code which defines variables and also with commenting throughout code so another dev can easily navigate code.
1/ Inventory from website to be parsed to MsQL database more quickly accessed when there is a search initiated based on intial 3 criteria inputed by user 2/ Install the script on the website so that users can access relevant search results based on user input of 3 criteria
3/ The script is to be written in PHP so that it integrates well with Wordpress site
4/ Code a multithreadedbot to scrape the data
5/ The script should move the scrapped data to a backup table before starting a fresh scrapping.
Project Success Defined as:
1/ 6000 records/hour is being scrapped from the website and updating MsQL base.
2/ no errors or bugs that result in slow product performance
3/ the records being scraped match website database
4/ the bot is accurate within 1 hour
5/ Admin preselects date/time for scraping of the website
6/ time and date of latest update is visible
7/ any of the scraping actions should not negatively affect the website being scraped of bring their website down
8/ install the script on wordpress site doing the scrapping
Milestone 1: $75 - it is verified that output from script shows to be what is in website database
Milestone 2: $75 - bugs or errors and smooth running script. Two weeks after competion of milestone 1 should suffient time to test and verify that the code is running well
Bonus: $75 if script reaches 10000 records/hr.
9 freelancers are bidding on average $218 for this job
I have had a year of experience just doing web scraping, looking for jobs from around 100 job sites. I've also had experience with multiprocess PHP programs. I know what I'm talking about so lets talk.