Web Crawler

  • Status Closed
  • Budget $30 - $100 USD
  • Total Bids 19

Project Description

I need a web crawler program which will crawl a site selling products. (I'll provide the url).

The data required would be the product categories, the products, the product specification and the "additional information" field.

The information should be displayed in a sequence of columns: for example

Product type (parent category - for example, shoes) -> brand (sub category) -> model (sub category) -> model number (sub category) -> model number variants (this is the actual product with its variants - for example, sizes).

The product specification consists of up to seven bullet point attributes - each of which is one sentence long.

There is also an "additional information" field which contains up to 200 words of text.

The program should be able to pull the catalogue categories, products, product specification and additional information put them in a flat file/delimited file with the data in columns when opened in Excel - eg Product type (parent category) column 1, brand (sub category) column 2, model (sub category) column 3, etc.

This is a relatively simple job which would suit someone who perhaps already has a similar program they've developed to which could be tweaked to fit my requirements.

The application should be able to run whenever I want and generate a inventory list. Ideally, I'd like the program to be able to crawl the site in sections.

I'd like the program as soon as possible, please.

Get free quotes for a project like this
Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online