Website Spidering/Scraping Bot Creation Needed

  • Status Closed
  • Budget N/A
  • Total Bids 20

Project Description

EXPERIENCE WEB SCRAPING TEAM NEEDED - We are seeking someone, or a team of people, to assist our efforts to scrape all of the information that is in the public domain from a number of different websites. These include [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view] and site such as these. We want to have the business card information such as first name, last name, job title, company, address, city, state, zip, phone, fax, email, website, etc. We do not need their employment history. This job is really a long-term project - where we could ultimately use somebody full time if they were to do a good job. You would need to be able to speak english.

There are roughly a hundred million contacts we need to capture. You should have experience with MYSQL so you could import the data there and then dedupe it. We also need spiders to capture the email addresses for same. If you know how to use Web Data Extractor that could help us too. We use WDE to scrape websites for their email addresses and then determine the format for the entire site.

You should have experience doing this already - for we have already spoken to people who have been doing this before. We need somebody to help out with this immediately - so please get in touch with us soon. You can check out our website at [url removed, login to view] or through

If you have data from previous jobs and the data is less than a year old, we would also be interested in picking up all you have. You should have the knowledge to also import the data into mysql and then run a deduping software system to ensure that all of the contacts are unique.

Get free quotes for a project like this
Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online