Completed

Scraping HTML source code for ~7000 products

This project was successfully completed by cheapexcell for $140 USD in 3 days.

Get free quotes for a project like this
Employer working
Completed by:
Skills Required
Project Budget
$30 - $250 USD
Completed In
3 days
Total Bids
31
Project Description

I have an xls of around 7000 UPC numbers along with their associated product number. I need to search on a specific website for each product UPC and scrape a specific div section in the HTML source code for each product. The HTML text will be saved in separate .txt files corresponding to each different product. The txt files are labeled as each product number--not UPC. The copied HTML text will be added to these txt files.

The software I use has added a zero at the beginning of most UPC numbers. If the first number is a zero it needs to be removed before searching is done.

Before doing all items I would like 10 test items (or rather the .txt files for 10 items) to be sent to me so I can make sure everything is working as expected on my end.

The website I want to scrape from uses AJAX.

I do not want this done manually.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online