I need a program that will scrape all of the product information (including all specifications of each product, any associated datasheet, protocol, manuals, and image files) for all the products on lifeome [dot] com website. Please note that all images and other files should be scraped in the form of a file, put into a zip file with the file name starting with the catalog number of the respective product (for multiple types of file of the same product, for example if there are multiple images for one product, then it should be catalog number followed by "_" and then 1, 2, 3 etc. The files should also be scraped in the form of a URL (.pdf or .png). Note this applies for all product associated files present on site.
Relations for certain attributes such as product size should be presented in separate excel file with the original catalog number in column A and related catalog number in column B. For example: If you have product ABC-100 which is 100g and product ABC-200 which is 200g then in excel file column A should be ABC-100 and column B should be ABC-200, so this will indicate that they are related (meaning same product just different size/color/shape/form, etc.)
Please note that all product information should be listed in columns within an excel file in an organized manner. Each column heading must indicate the type of information that was scraped based on the specification/information type which is noted within the site.
This program should be re-runable at anytime to updated all the product information. and again create another excel file full of updated information (as long as the site interface will not change).
A template will be provided to help guide the format of data entry.
18 freelancers are bidding on average $150 for this job
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
Hi there, I can write this scraper for you. I mostly understand how you want your output to be formatted but will need clarification. Regards, Julijan