Data extraction/crawling from yelp and opentable
This project received 21 bids from talented freelancers with an average bid price of $654 USD.Get free quotes for a project like this
Project Budget$250-$750 USD
Hello, I need a web crawler to download and parse data from [url removed, login to view] and www.opentable.com. Mainly the product ratings, text and reviewer characteristics. You can send me the csv files or sql database file.
You will first need to match restaurants for the two websites in 6 major cities: New York, LA, Chicago, Houston, Philadelphia, Pheonix
After achieving matches, you will need to download variables such as ratings.
Variables to download (Yelp):
(a) Merchant Data: merchant name, website URL, Zip code, State, city, hours of operation, business website URL, parking, wifi, merchantID
(b) Review Data: Repeated for the same merchant (sort by time stamp): time stamp, rating, text, words, check-in, useful, funny, cool, reviewID, reviewerID
(c) Reviewer Data: number of friends, number of reviews, number of tips, number of fans, number of local photos, tenure (months since join), location (state and city), average of ratings (may be difficult), elite, reviewerID
Variables to download (opentable):
(a) Merchant Data: merchant name, website URL, merchantID
(b) Review Data: Repeated for the same merchant (sort by time stamp): time stamp, overall_rating, food_rating, ambiance_rating, service_rating, text, words, helpful, unhelpful, reviewID, reviewerID
(c) Reviewer Data: review count, first review date, featured reviews, average rating, reviewerID
Feel free to ask any questions.
Looking to make some money?
- Set your budget and the timeframe
- Outline your proposal
- Get paid for your work
Hire Freelancers who also bid on this project
Looking for work?
Work on projects like this and make money from home!Sign Up Now
- The New York Times
- Wall Street Journal
- Times Online