What do i want to know:
1. How many products a company sells
2. If the negative review mentions words from a list i give the software
3. Results show company, number of products, total number of reviews, total number of critical reviews, number of reviews with result positive from word list. * - If the product is sold FBA. If product is sold by [url removed, login to view]
4. Returns a csv list with 2 parts:
Name of company,number of products,total reviews of all products,number of critical reviews,number of reviews with words
etc. etc. etc. till finish
Product URL, Keyword found from list, date reviewed, link to the review, Shipping Route
*note new product creates an empty row to parse them.
I give software a list from website A.
There is an input for min. and max number of products to include companies to scrape. In this case I use 15 and blank. If blank no upper limit. If both are blank no limitations of any kind. Software has a settings tab where i can set default numbers here.
There is an input with run (blank) to (blank) and i can put in letters from # to Z. Again a settings tab to set default
-If A to A is put in it runs only on letter A
-If nothing is put in it runs only the page given.
-If # to D is put in . . it runs each page result from page A to D in alphabetical order.
There is an input for a file that holds a word list to check reviews for. Settings tab for location for default of this file
If any results have 15 or more in the parenthesis it adds this company to have its products scraped.
Input for min total reviews for a company - the number of total reviews for all products must exceed this number or the software skips that company
Project will be attached to a prior build with proxy functionality