Create a scraper / script / crawler to extract product data from an online shop - go through all products - export in csv or excel

This project was awarded to name63 for $99 USD.

Get free quotes for a project like this
Project Budget
$30 - $250 USD
Total Bids
Project Description

Dear freelancers,

we need an effecient web scraper, which we can run on one of our own servers. WE ARE LOOKING FOR AN EXPERIENCED DEVELOPER - work must be flawless!

Following should be done:

The website to scrape/crawl is: [url removed, login to view]

--> It is an online shop with almost 80k products. The scraper should do the following: It should start with these main top level categories:

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

And should scrape EVERY SINGLE product within these categories (as said, something around 80,000 items).

The following information should be exported from EACH product - please use this url to understand different items explained below: [url removed, login to view]:

1) URL (e.g. "[url removed, login to view]")

2) Breadcrumbs (e.g. "Início > Masculino > Esporte Masculino > Calçados > Tenis")

3) Brand Name - located above product name (e.g. "Puma")

4) Product Name (e.g. "Tênis Puma Axis 2 Branco")

5) Image URLs --> ALL Images in product page --> USE default resolution (not zoom image) of ~275px × 400px (e.g. "[url removed, login to view] ; [url removed, login to view] ........ etc etc")

6) Current Price (e.g. "99,90")

7) Old Price - if applicable (e.g. "199,90")

8) Payable rates - if applicable (e.g. "5 x 19,98")

9) Available sizes: (e.g. "38, 39, 40, 41, 42, 43")

10) ALL Available Data in the tab "Detalhes do produto" --> Data here is:

--> A) a short text description AND

--> B) a list with multiple different entries (NOTE: products do not always have all these entries --> compare [url removed, login to view] versus [url removed, login to view]):

--> List items could be:

- Description (plain text above actual list)

- SKU (e.g. "RA870APM16PQL

- Modelo (e.g. "POLO RALPH LAUREN 89460PRL")

- Material (e.g. "Algodão")

- Composição (e.g. "100% Algodão")

- Cor (e.g. "Preto")

- Lavagem (e.g. "Lavar a mão")

- Medidas (e.g. "Ombro: 17cm/ Manga: 23cm/ Tórax: 116cm/ Comprimento: 76cm")

- Categoria (e.g. "Premium Masculino > Roupas > Pólos > Pólo Manga Curta")

--> That is all data we need for EACH product

***NOTE*** --> We will need to run the script MULTIPLE times per week: SO: The script MUST be effecient an FAST. The data should be extracted and then saved on the server (in csv or any other excel importable format). The script should possible to be run on OUR server.

***NOTE*** --> We are looking for a long term developer - we will not just need ONE script, BUT we will need similar scripts for 10 different online shops. SO: We are looking for somebody to then also develop other scripts.

Please get in touch if you have any questions.

Thank you very much,


Awarded to:

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online