We will be scraping activity from a forum using the publicly available json-based API. Please open the below Google spreadsheet for the sample file, layout, etc...
https://docs.google.com/spreadsheets/d/12UJkIpS-a9qwzKtDtXzhgpywDVgF4XCyUSYDMWZnqcg/edit#gid=0
1. the first step of this project is to scrape the list of topic IDs from the HTML file in Google Drive. You should also get the number of responses from this page as well (it will come in handy for step 2)
2. For each identified topic ID, requested details vis the JSON API. for each individual post included in the API results, parse relevant fields (as defined in the spreadsheet), and also strip the text field of any HTML tags.
3. Depending on the number of responses, you may have to page through the results. It would be ideal if you looked at the number of responses during step one, divide the number by the number of results per page, and determine how much paging is required.
It is important to me that if you take the time to respond, please prove to me that you actually have reviewed and understand what I'm asking for. Essentially it boils down to having a CSV file of every single individual post from that form.
Other relevant files on Google Drive:
Folder:
[login to view URL]
JSON Layout:
[login to view URL]
Hi there, I have read the project & checked the google drive links and the files attached..
I can create this script in python to get data from the [login to view URL] API.. I have good web scraping reviews in python as well..Hope to hear from you..
$50 USD in 1 day
5.0 (64 reviews)
5.6
5.6
2 freelancers are bidding on average $53 USD for this job