Completed

Webpage scrapping, parsing, HTML

This project was successfully completed by rsdsoftsl for $89 USD in 2 days.

Get free quotes for a project like this
Employer working
Completed by:
Project Budget
$30 - $250 USD
Completed In
2 days
Total Bids
28
Project Description

I am looking for someone who can set me up with the ability to extract information about items/bags from [url removed, login to view]
remark: the feed only shows the latest ones, but each bag can be viewed via [url removed, login to view][BAGID]
[bagid] is an incremental integer starting from 1.

My requirements are:
- The script should be easy for me to run. Because I am not an expert you will need to outline what software/method I should use for this
- There should be two variables that I can define each time I run the script: the beginning bag number + the ending bag number (bags are listed under [url removed, login to view]) (the bags between this range should be extracted, e.g. "script getbags 10000 20000" will scrape all bags from ID 10000 to ID 20000
- The output files should be csv

In terms of exact outputs here are my requirements:

[The list of latest bags on [url removed, login to view] is here: [url removed, login to view]
and a sample bag is here: [url removed, login to view]]

The FIRST csv output file should contain columns with the following headings/content:
(0) Bag ID
(1) Brand + Item Category
(2) Size
(3) Price
(4) Payout
(5) Sold [yes/no may be 0/1] ('sold flag' in source code)
(6) Customer Name
(7) Location
(8) Total earned

Example for bag id 114492
BagID Brand + Item Category Size Price Payout Sold Customer name Location Total Earned
114492 Carter's Chino Short 4T 4T 4,49 0,45 yes thredUP Customer Missouri City, TX 8,79
114492 Arizona Jean Company Shorts 4T 4T 3,99 0,40 yes thredUP Customer Missouri City, TX 8,79
114492 Carter's Swimsuit 3T 3T 8,49 2,38 yes thredUP Customer Missouri City, TX 8,79
114492 ...
114492 ...
114492 ...
114492 ...
114492 ...

The SECOND csv output file should contain columns with the following headings/content:
(1) Bag ID
(2) Number of Items
(3) Customer Name
(4) Location
(5) Total earned [sum of payouts]
(6) Total price [sum of prices]

Example:
BagID Number of Items Customer Name Location Total Earned Total price
83212 14 Kimberly Dayton, OH 22,06 79,86
83213 25 thredUP Customer Glenmoore, PA 45,48 164,75
83214 nil nil nil nil nil
83215 ...

Some bag IDs may not exist, they should be listed with "nil" in the other columns in both lists.
Example: [url removed, login to view] [content], [url removed, login to view] nil, [url removed, login to view] [content]

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online