In Progress

Web Scraping a Business Directory - repost

I've tried scraping the following website but can't parse the data for some reason [url removed, login to view]

I want the following information on Real Estate companies in Calgary, Alberta, Canada (here's an example [url removed, login to view]):

Company Name

Address

Website

Phone Number

Company Description

SIC Code

NAICS Code

Contact Name

Contact Position

Location Type

Revenue

Employees

Years in Business

This information can be selected using the following code:

item['company'] = [url removed, login to view]('//h1[@class="company-name"]/text()').extract()

item['address'] = [url removed, login to view]('//div[@itemprop="streetAddress"]/text()').extract()

item['website'] = [url removed, login to view]('//dl[@class="website_info"]/dd/span/text()').extract()

item['phone'] = [url removed, login to view]('//dd[@class="tel"]/text()').extract()

item['description'] = [url removed, login to view]('//p[@itemprop="description"]/text()').extract()

item['contact_name'] = [url removed, login to view]('//span[@itemprop="name"]/text()').extract()

item['contact_position'] = [url removed, login to view]('//em[@itemprop="jobTitle"]/text()').extract()

item['location_type'] = [url removed, login to view]('//table[@class="table-data"]/tr[1]/td/text()').extract()

item['SIC'] = [url removed, login to view]('//table[@class="table-data"]/tr[4]/td/text()').extract()

item['NAICS'] = [url removed, login to view]('//table[@class="table-data"]/tr[5]/td/text()').extract()

item['revenue'] = [url removed, login to view]('//table[@class="table-data"]/tr[2]/td/text()').extract()

item['employees'] = [url removed, login to view]('//table[@class="table-data"]/tr[3]/td/text()').extract()

item['years_business'] = [url removed, login to view]('//table[@class="table-data"]/tr[8]/td/text()').extract()

This MUST be done using Scrapy, the web crawling framework written in Python.

Your deliverable is an excel spreadsheet with the above information on each company. There are 1,355 Real Estate companies in Calgary, Canada, so I'm expecting that many rows.

For proof that you scraped the appropriate information I require a screenshot of the excel spreadsheet showing the last 20 rows. If the information on those last 20 companies matches what is found on Manta I will pay you the agreed price in exchange for the excel file.

I may have additional work for you if you complete this task successfully.

Skills: Data Mining, Python, Web Scraping

See more: web scraping python 3, web scraping price, web scraping business, text em, scraping the web, python web framework, python exchange, python canada, n.s. corp, excel spreadsheet business, directory web scraping, data scraping company, what is web crawling, what is data scraping, scraping a website, web scrapy, web scraping excel, scraping python, python web, python task, python scraping, python excel, extract phone number, excel python, crawling of data

About the Employer:
( 1 review ) Calgary, Canada

Project ID: #4475755

Awarded to:

ashok7925

Hi, Please check your PM.

$105 CAD in 2 days
(12 Reviews)
4.7

17 freelancers are bidding on average $139 for this job

SigmaVisual

I can help in your project, please check PMB and our ratings/reviews to get idea of our experience. Please let me know if you have any queries.

$231 CAD in 5 days
(36 Reviews)
6.5
mantislin

Hi, I am scraping expert, More details in PM.

$220 CAD in 5 days
(70 Reviews)
6.2
phpXpertbd

Seasoned web scraper. I worked on many similar projects, I have big experience in data mining projects. I can finish this task in short time, with the best quality.

$147 CAD in 3 days
(18 Reviews)
5.9
diamond247

Expert here, highly skilled team with expert operator, please see our details am sure it will touch your requirement, ready to start now.

$63 CAD in 3 days
(29 Reviews)
5.6
wildlily980

It's done.I'll send you example data.

$135 CAD in 3 days
(12 Reviews)
5.6
fancywwh

Manta expert ready to start, thank

$55 CAD in 1 day
(27 Reviews)
5.2
uumairkhalid

Hi.. Expert web scraper here. Interested in your project. I assure you 100% accurate and good quality work. Ready to start. have a look at PM. Regards

$154 CAD in 3 days
(22 Reviews)
5.0
BrothersTeam

Hi I am able to start the job right now. Thanks

$154 CAD in 7 days
(21 Reviews)
4.3
ZedASolutions

expert manta scraper at your service. plz contact immediately to get this done

$110 CAD in 3 days
(13 Reviews)
3.3
mobeenraheem

Ready For Your Project.

$220 CAD in 7 days
(5 Reviews)
3.3
phpmysqlrocks

Ready to start now. Thanks

$136 CAD in 3 days
(1 Review)
3.1
Shohagmax

Ready to start. Please check PMB. Thanks.

$154 CAD in 7 days
(6 Reviews)
2.6
snafoo

Hello! I'm a recent CompSci graduate from New Zealand. I have alot of experience using Python - for these kind of projects aswell(little scripts to automate mundane tasks), I believe I can complete this for you quickly More

$110 CAD in 3 days
(1 Review)
1.7
geekbrainstorm

Hi, Please see PM.

$88 CAD in 3 days
(1 Review)
0.8
willbryant

Hello, this seems simple enough. I can definitely get it done for you.

$220 CAD in 3 days
(0 Reviews)
0.0
andrewscode

Expert scraper here. You can have the data from Manta tomorrow. Let's do this quick. Thanks, Andrew

$110 CAD in 2 days
(0 Reviews)
0.0
mxdesigns

just wrote a project like this, sent you a PM with the link

$109 CAD in 0 days
(0 Reviews)
0.0
mjabdoli62

Hello . i interesting your project. i have experience in scraping. Please check my pm

$94 CAD in 3 days
(0 Reviews)
0.0