Completed

online business directory extractor - page crawler/screen scraper

Hello, I need a tool to rip from [url removed, login to view], all business details, name, address, telephone numbers, & website. for marketing purposes. Im not trying to make my own!

When you enter a search term into [url removed, login to view] such as "garages" in the "uk" it brings up 48, 575 garages in the UK. I then want to be able to copy all of these garages details into a spreadsheet for my marketing use.

I dont just want this done for garages, i want to have a working copy of the program, so i can do various searches myself.

The catch is, yell only brings up 10 pages at a time, with 10 businesses on each page. so even if you search for "garages" in the "uk" it says there is 48,575 but you can only view 10x10 =100 of them.

Your the experts, so if you know a great way of doing some coding to just get all the search results at once then great. But the only way round it i can think of is doing multiple searches for each area, such as below:

the software would have to do a search for "garages" in every postcode which exists (the first 4 digits, e.g. GU51 (I can provide a list of all uk postcode areas))) which would be 2482 postcodes. And search for the term in each postcode area, view the 10 pages of 10 entries, rip them off into the spreadsheet, then this would obviously create lots of duplicates, but it would remove these duplicates. And after going through all the postcodes, it would have ripped off a comprehensive list of whatever I was searching for.

The software would obviously need to be pretty robust to do this, and not crashing all of the time, and not take too long to run a search. E.g. if it ripped off details of 20,000 businesses within a few hours, i would be happy. But ideally faster!

I would like the software to basically consist of the following:

1)a box for the search term i want to search to be entered.

2) tick box for searching whole of UK/go through the postcode list and compile all of results, removing duplicates.

3) be able to enter a specific place e.g. "manchester" and it just brings back first 100 results for that area which yell brings back.

4) be able to select each postcode individually. e.g. check box for each postcode which i wish to search/rip data from. and it goes through that.

5) i would want it to give me detailed stats

Skills: Engineering, Linux, Microsoft, Project Management, Script Install, Shell Script, Software Architecture, Software Testing, UNIX, Windows Desktop

See more: yell pages uk, yell data uk, yell data, want a website for online business, various online business, spreadsheet experts, some online business, search term tool, search for a place online, run a c++ program online, online marketing experts, online directory for business, online coding software, online business which is the, online business online, online business name search, online business name, online business directory list, marketing experts uk, make up online program

About the Employer:
( 15 reviews ) Hebburn, United Kingdom

Project ID: #3840256

Awarded to:

jhaisalvador

See private message.

$100 USD in 14 days
(65 Reviews)
5.4

13 freelancers are bidding on average $140 for this job

mhmhz

See private message.

$127.5 USD in 14 days
(165 Reviews)
6.7
radzivil

See private message.

$126.65 USD in 14 days
(88 Reviews)
6.1
shortwire

See private message.

$161.5 USD in 14 days
(98 Reviews)
5.6
coderprovw

See private message.

$170 USD in 14 days
(38 Reviews)
5.6
belovzorov

See private message.

$170 USD in 14 days
(35 Reviews)
5.3
MCATARO

See private message.

$170 USD in 14 days
(29 Reviews)
4.8
sneka

See private message.

$170 USD in 14 days
(3 Reviews)
3.9
sunny05tt

See private message.

$68 USD in 14 days
(22 Reviews)
3.8
Ehurec

See private message.

$40.8 USD in 14 days
(5 Reviews)
2.3
d2bsolutions

See private message.

$170 USD in 14 days
(10 Reviews)
2.3
serialcodervw

See private message.

$170 USD in 14 days
(0 Reviews)
0.0
erpoojasharma

See private message.

$170 USD in 14 days
(1 Review)
0.0