Closed

Web Scraping Highly Secure Business Directory of Public Information

I want the following information on Real Estate companies in Calgary, Alberta, Canada (here's an example [url removed, login to view]):

Company Name

Address

Website

Phone Number

Company Description

SIC Code

NAICS Code

Contact Name

Contact Position

Location Type

Revenue

Employees

Years in Business

This information can be selected using the following code:

item['company'] = [url removed, login to view]('//h1[@class="company-name"]/text()').extract()

item['address'] = [url removed, login to view]('//div[@itemprop="streetAddress"]/text()').extract()

item['website'] = [url removed, login to view]('//dl[@class="website_info"]/dd/span/text()').extract()

item['phone'] = [url removed, login to view]('//dd[@class="tel"]/text()').extract()

item['description'] = [url removed, login to view]('//p[@itemprop="description"]/text()').extract()

item['contact_name'] = [url removed, login to view]('//span[@itemprop="name"]/text()').extract()

item['contact_position'] = [url removed, login to view]('//em[@itemprop="jobTitle"]/text()').extract()

item['location_type'] = [url removed, login to view]('//table[@class="table-data"]/tr[1]/td/text()').extract()

item['SIC'] = [url removed, login to view]('//table[@class="table-data"]/tr[4]/td/text()').extract()

item['NAICS'] = [url removed, login to view]('//table[@class="table-data"]/tr[5]/td/text()').extract()

item['revenue'] = [url removed, login to view]('//table[@class="table-data"]/tr[2]/td/text()').extract()

item['employees'] = [url removed, login to view]('//table[@class="table-data"]/tr[3]/td/text()').extract()

item['years_business'] = [url removed, login to view]('//table[@class="table-data"]/tr[8]/td/text()').extract()

This MUST be done using Scrapy, the web crawling framework written in Python.

Your deliverable is an excel spreadsheet with the above information on each company. There are 1,355 Real Estate companies in Calgary, Canada, so I'm expecting that many rows.

For proof that you scraped the appropriate information I require a screenshot of the excel spreadsheet showing the last 20 rows. If the information on those last 20 companies matches what is found on Manta I will pay you the agreed price in exchange for the excel file.

I may have additional work for you if you complete this task successfully.

Skills: Web Scraping

See more: web scraping python 3, web scraping price, web scraping business, text em, secure corp, scraping the web, python web framework, python exchange, python canada, n.s. corp, excel spreadsheet business, directory web scraping, data scraping company, what is web crawling, what is data scraping, web scrapy, web scraping excel, scraping python, python scraping, python excel, extract phone number, excel python, crawling of data, canada business data, business p

About the Employer:
( 1 review ) Calgary, Canada

Project ID: #4513409

17 freelancers are bidding on average $156 for this job

renesoft

Hello. Please read pm.

$263 CAD in 5 days
(8 Reviews)
6.0
ehsankayani

HI, I can do this for you but why you want it done by scrapy? You will get the same info you need Thank you

$157 CAD in 4 days
(32 Reviews)
5.8
sonarkaushik

Sir, I can do the project. Refer PMB. Looking for further discussions in this matter. with thanks and regards

$142 CAD in 2 days
(11 Reviews)
4.2
ideadezigner

Please check PMB

$144 CAD in 3 days
(8 Reviews)
3.5
bob1982

I have great experience in website data extraction.

$147 CAD in 3 days
(5 Reviews)
3.2
jhliuster

I have already got all data, PM for more details, thanks.

$54 CAD in 0 days
(4 Reviews)
3.0
nuid

Sent you a detailed PM.

$333 CAD in 3 days
(1 Review)
2.6
Nevergivesup

Plz check pm

$155 CAD in 3 days
(2 Reviews)
2.5
occsceo

Over 15 years exp., please see pm.

$225 CAD in 4 days
(1 Review)
2.0
sunnyreddy1127

Hi, Please check your inbox for sample. Thanks.

$150 CAD in 3 days
(0 Reviews)
0.0
SovDyn

Hi, if I query Manta I only find 1010 results by this criteria: http://www.manta.com/world/North+America/Canada/Alberta/Calgary/?search=Real+Estate Are you running a different query?

$155 CAD in 3 days
(0 Reviews)
0.0
abolfathi

I'd done similar projects before.

$100 CAD in 3 days
(0 Reviews)
0.0
bunnybabu

Hi, Please check your PM.

$111 CAD in 4 days
(0 Reviews)
0.0
webscraper27

Hi, I have done similar project earlier. I can deliver the data in the given time frame. Please check the sample in your inbox. Thanks

$200 CAD in 3 days
(0 Reviews)
0.0
GladiatorSoft

Please check PM. HALF of the job is already done.

$111 CAD in 3 days
(0 Reviews)
0.0
mukki007

Hi I am interested very much in this project., Ready to take up immediately.

$45 CAD in 3 days
(0 Reviews)
0.0
heyram1

Please see your pm.

$144 CAD in 3 days
(0 Reviews)
0.0
clarix

Hello, I have developed many scrapers in python using Beautiful Soup, Twill and Scrapy, I have gathered in some occasions around 500,000 records. I look forward to being awarded this project and hope to continue to wo More

$172 CAD in 5 days
(0 Reviews)
0.0