Linked In Scraping

CLOSED
Bids
13
Avg Bid (USD)
$549
Project Budget (USD)
$250 - $750

Project Description:
Hello Scraping Experts:

We are looking for someone to write a tool that we can use to scrape the information found on Linked In. We are specifically looking for anyone's information in a spreadsheet format who is a "Chief", President, EVP, SVP, VP, Director or Manager. We have the servers to run the software that is written, would prefer it to be in windows format vs. linux although we do have linux servers.

You should have already written scraping software solutions in the past - with a specific focus on Linked In. If you have not already written a scraping software tool for Linked In, do not waste your time, or our time by contacting us. You should be able to start this project immediately. We will have a team of individuals who are managing the scraping on our own servers. We have close to a thousand proxy server IP addresses to assist with the scraping process.

Below is a description:

I am looking for a programmer. I need all the contacts under Linkedin to be put in a database with the details below. We have roughly four million companies throughout the world. We can provide the URLs of each of these, as well as the names of the companies.

Details to be extracted are: (sometimes one or more of these sections do not exist on the company page)
-Name of the employee
-Title
- The name of the company
- "HQ Region" - city and country need to be extracted in separate columns
- "Industry"
- "Type"
- "Status"
- "Company Size"
- "Website"
- "Revenue"
- "Founded"
Job Title
Experience
Any additional information we can capture

- A link towards the linked-in profile of the company (please save the profile locally)
- Number of "Current Employees" registered on linkedin
- "Parent company" name
- "Headquarters Address": in one column please put the overall address, in another one please extract the Country only of that field

We believe there are roughly 50 million profiles that meet our needs. We need this information to be delivered in a FileMaker Pro database. The programmer needs to have already done something similar with Linkedin and must know how to do that precisely. The programmer must be familiar with linkedin limits - both in terms of search results (500, so the program will need to break down the search per country, industry and company size to be sure to grab every single company/contact) and in terms of the number of company pages that can be accessed on a given timeframe from an account / IP address.

This is a very easy task and easy money for someone who has already done something similar. I do not care which technologies are used, it could be any tool or programming language (although I think Excel VBA is not robust enough). We have a bank of dedicated servers that can be used for this job. There are ten servers currently but we could add another ten if necessary.

To prove that you can do it quickly, please send me a sample data of a 5,000 individuals with Chief in their "in current job title and US in current country- in an excel spreadsheet with basic columns . Send me that file together with your confirmation that you are sure that can perform the overall task - I will then chose you right away and we will move forward quickly together. You must be able to write perfect english. We would prefer to work with those who can also speak english.

Thanks a lot,

The Administrator removed this message for containing contact details which breaches our Terms of Service

Skills required:
HTML, MySQL, PHP, Software Architecture, Web Scraping
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.