Project Description:
Hi,
I need help on "data mining" the features of product from product web pages. The features should be mined from the specifications, detailed descriptions, etc on a web page. The requirements include: scrape 500 product web pages from 20 different e-commerce sites, mine the scraped web pages to identify product attributes (such as product name, product category, model, size, price, weight, special features, etc.), output the attributes as <attribute, value> map. The algorithm should be generic enough to be used to more than those 20 scrapped testing sites.
The attached zip files have a few sample product web pages.
thanks for your interest,
Richard