HTML Page Building - Data Mining RA-NO-FR 2013-05-25

  • Status Completed
  • Budget $30 - $250 USD
  • Total Bids 28

Project Description

Our projects consist of taking existing content of product pages (HTML) from our providers & removing unnecessary tags & coding (cleaning up the HTML) & then delivering HTML to us. Remove all CSS classes & do some typing, plus mostly copy/paste. You'll need to have an HTML editor with search/replace function like Dreamweaver.

This job contains approximately 473 line items. This will mean copy/paste & clean-up HTML from approximately 473 pages like found in the URLs below.

The list of products is provided in a CSV file. All HTML will be returned pasted in a CSV file. It is very simple. There will be a product name & you will be able to pull the item page directly from the vendor's site. When there is a supplement facts data table, use our pre-formatted table, thus making it very easy to copy/paste the data in the table.

Here is an example page from which you would capture data:

1. [url removed, login to view] (including the table data - we already have a table formatted. All you need to do is copy/paste.)

Once all data has been captured, we need all whitespace removed between HTML tag brackets. Simple removal done in Dreamweaver or other editor.

This is an ongoing opportunity of multiple projects of approximately 16,000 possible pages. These will be independent jobs paid per job. Each page takes on average 4-5 minutes. This equates to about 10 pages per hour giving some buffer.

An example page result is attached along with detailed instructions. Please provide us two work samples of any two products from the list along with your bid. We will evaluate your sample & award the job.

There is no copyright issue since we have permission to obtain this information and re-use it.

Get free quotes for a project like this
Completed by:

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online