Product Feature Extraction from Webpages

IN PROGRESS
Bids
12
Avg Bid (USD)
$358
Project Budget (USD)
$250 - $750

Project Description:
Hi,

I need help on "data mining" the features of product from product web pages. The features should be mined from the specifications, detailed descriptions, etc on a web page. The requirements include: scrape 500 product web pages from 20 different e-commerce sites, mine the scraped web pages to identify product attributes (such as product name, product category, model, size, price, weight, special features, etc.), output the attributes as <attribute, value> map. The algorithm should be generic enough to be used to more than those 20 scrapped testing sites.

The attached zip files have a few sample product web pages.

thanks for your interest,
Richard

Skills required:
Data Mining, HTML, Java, Machine Learning, Web Scraping
Additional Files: coffeemaker.zip
Hire richardxy
Project posted by:
richardxy United States
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the project creator or as one of the bidders to view bids.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.