I need somebody to write a script for me to scrap specific contents from two websites and putting it into mine based on user input on my website through a form post. One of the website has a simple image math captcha (like 13+4=?).
I have already put together a php curl script to get contents from these websites except captcha breaking script and some other improvements. And that's where I need help. The response from php curl needs to be formatted properly using xpath and produced to end user.
Additional Project Description:
01/12/2013 at 2:51 EST
Guys, please bid only if your are well versed of the requirement and bid sensibly. I am a programmer myself and let me tell you this is not a simple web scraping script. You need to be good with following:
1) Cookie Session management between getting the captcha and posting the result,
2) Breaking captcha
3) Know how to bypass Rate Limitation where certain website puts restriction that you can't make another post request within certain period of time.
4) You can use IP spoofing or something similar that my IP doesn't get blocked while doing these scrapings.
As, I explained in the requirement, I already have 60-70% of the script ready with me. Rest of the requirements to be completed by somebody who is really good at above listed things.