Hi coders, I need a crawler to scrape pages off Archive.org. In particular, I am interested to get archived Backpage ad postings. The project requires 3 steps: 1) collect the URLS of cities from [url removed, login to view], 2) input the URLs on [url removed, login to view], 3) collect the needed information and parse them in the right format and output them in csv format. The attached Word document has the details, please read it before making a bid.
I am on a tight budget, and I could only offer less than $100. But for that, I am willing to give more time to this project. Thanks!
11 freelancers are bidding on average $165 for this job
Your main requirements are clear. We can do this for you and we have mentioned how we are going to do this in the message we sent you. Please go through it and contact us for more details. Solution Infinity.
From the specifications ([url removed, login to view]) this looks like a fairly straightforward crawling/extraction task with a few twists (date boundaries, etc.)
Hi Sir. I have experience with crawling web (I did a project crawling information from [url removed, login to view]) . Please check your PM for more information. Thanks.