I'm doing some market research and I want to see structured data within a CSV/Excel/JSON file on all of the properties within a specific area that match a specific search condition.
Data required from each property:
- Property title
- Property description
- Number of bedrooms
- Number of beds
- Number of reviews
- Avg Review rating (0-5)
- Number of bathrooms
- Max number of guests
- Host name
- Property Amenities
As an example, here is what a page listing would look like: [login to view URL]
Here is the list of the properties that we're interested in: [login to view URL]
The task is to create a basic Node application, that utilises Puppeteer - [login to view URL] to scan based off of a URL String Variable, navigate into each page, then use something like Cheerio - [login to view URL] to scan the HTML and retrieve the above points of data before returning to the previous page, and clicking on the next available listing to scan for data again until all listings have been completed.
The data would then be available/saved locally within the project repo in a CSV/Excel/JSON format (anyone is fine).
Skills Required: Node, Puppeteer(Or Cypress or Playwright), Cheerio (or a similar package).