Closed

Parse HTML and INSERT into mySQL

This project was awarded to vintcn for $150 USD.

Get free quotes for a project like this
Employer working
Awarded to:
Skills Required
Project Budget
$100 - $500 USD
Total Bids
8
Project Description

Hello,

We sell digital cameras and needs to populate products specs in a quick manner. I am looking for a small script that will extract data from www.shopping.com. Before I go in to details, I do have permission to spider thier site and to use the data since we buy their advertising.

Here is an example page:
[url removed, login to view]

Notice the row in gray, the main Spec Category. The following Sub categories are below. What I need is the data to the right of the sub categories. The HTML is fluent through all digital camera specs. So it should be easy to write the parser to find where to parse and collect and when not to.

So all cameras within:
[url removed, login to view]

need to be parsed. If I am correct, I do beleive they use rdd or xml. Notice the links have xPP and xPF. I guess the parser could check every possible link that has the word digital AND camera then decide to parse the page.

So what needs to be added to the db? The title of the name of the camera - can be collected from header tags TITLE, but only the camera name like FUJI FINEPIX S5000 DIGITAL CAMERA. From there, the specs to be collected then INSERTED into a mysql db which will have the same subcategories name. On the very last row of the specs is a sub category of Product ID, this is not needed.

I need this done ASAP. I suspect it should take 3-4 hours to do this job. I am willing to pay $35/hour or a max of $140.

If you have any questions please reply back. Thanks

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online