Closed

Scrape content from a list of urls

This project was awarded to hieutc for $250 USD.

Get free quotes for a project like this
Employer working
Awarded to:
Skills Required
Project Budget
$30 - $250 USD
Total Bids
27
Project Description

I have a list of urls, approximately 20,000.

from the list of urls I need this data extracted:
-Screenshot of Home Page
-Title
-Meta description
-home page content (textual elements only if possible)
-any contact info that can be extracted and parsed. (each should be in their own column like email,phone, address, etc)

I'll need this back in an excel file with the added data columns in the exact order as source list. The images should be in a separate zipped folder with that column referencing the image ID name. for instance 00001.jpg.

Thanks

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online