In Progress

Database from web scraping of Google results

This job requires an automated method to create a database of pages, the the text on those pages, and links from those pages and their text.

What I am trying to do is build a database of pages that will enable me to figure out

1) Which cities, towns, schools and other public institutions in the US and Canada have an emergency notification system

and

2) Which vendor they are using.

Because the text this will find is not consistently formatted, we've come up with this method, but would be interested in any suggestions you have for improving it.

1) run Google search on these search terms: register OR registration "emergency notification"

2) Identify the URLs of the found pages

For each URL:

3) Copy all of the text on the page

4) Copy all of the source of the page (separate from text of page)

5) If there is a link on the page with any of the text below in the address, go to that link and store as "Linked URL"

6) copy the text on that Linked URL page and store that as "Linked URL Text"

7) If any of the links appear on either the first page source or or firstt page text or the second URL, enter the appropriate brand (Noted below in parentheses)

8) If the US state in the the text - put that in the State column or field

The collected data needs to be stored in an excel spreadsheet or other format we agree on.

Links to registration pages will have this text in them. Each line represents one link :

[url removed, login to view] (brand is CityWatch)

[url removed, login to view] (brand is FirstCallNetwork)

[url removed, login to view] (brand is CodeRed)

[url removed, login to view] (brand is Everbridge)

[url removed, login to view] (brand is TwentyFirst Century)

[url removed, login to view] (brand is Rave)

[url removed, login to view] (brand is Deltalert)

[url removed, login to view] (brand is OneCallNow)

[url removed, login to view] (brand is RepidNotify)

[url removed, login to view] (brand is Nixle)

[url removed, login to view] (brand is Swift911)

[url removed, login to view] (brand is Cassidian) - this one will be in the form of [url removed, login to view], where XXXXXX is the name of their client, as in madisoncounty.onthealtert.com.

US State: Abbreviation:

Alabama AL

Alaska AK

Arizona AZ

Arkansas AR

California CA

Colorado CO

Connecticut CT

Delaware DE

Florida FL

Georgia GA

Hawaii HI

Idaho ID

Illinois IL

Indiana IN

Iowa IA

Kansas KS

Kentucky KY

Louisiana LA

Maine ME

Maryland MD

Massachusetts MA

Michigan MI

Minnesota MN

Mississippi MS

Missouri MO

Montana MT

Nebraska NE

Nevada NV

New Hampshire NH

New Jersey NJ

New Mexico NM

New York NY

North Carolina NC

North Dakota ND

Ohio OH

Oklahoma OK

Oregon OR

Pennsylvania PA

Rhode Island RI

South Carolina SC

South Dakota SD

Tennessee TN

Texas TX

Utah UT

Vermont VT

Virginia VA

Washington WA

West Virginia WV

Wisconsin WI

Wyoming WY

I have added a spreadsheet filled out by hand to show what we want and the Word document shows the start with the google search and also the final result we are trying to achieve (although not part of this project.)

Skills: Data Entry, Excel, Web Scraping, Web Search

See more: swiftreach getrave, www nj com, www google co de, wisconsin job search, wisconsin job net, wi job search, where to find a data entry job, what is an ia, what is a data entry form in a database, web form to google spreadsheet, vt job link, vermont job link, va job search, ut source, utah job search, utah job, using google spreadsheet, texas data entry, spreadsheet web form, spreadsheet on web page, scraping data from web database, pa job search, one nevada, oklahoma job search, ohio job search

About the Employer:
( 84 reviews ) Charlotte, United States

Project ID: #4422003

Awarded to:

Evs1

Hello, Greetings From Evirtual Services!! We understand your value of your time and believe in excellent service with 100% satisfaction. I am Jason and I represent Evirtual Services fast growing US based compa More

$275 USD in 3 days
(1 Review)
1.5

12 freelancers are bidding on average $315 for this job

diamond247

Expert here, highly skilled team with expert operator, please see our details am sure it will touch your requirement, ready to start now.

$257 USD in 4 days
(84 Reviews)
6.6
Kamalkishover

We have done similar project, Pls Check PM

$275 USD in 3 days
(220 Reviews)
6.6
SigmaVisual

I can help in your project, please check PMB and our ratings/reviews to get idea of our experience. Please let me know if you have any queries.

$250 USD in 5 days
(50 Reviews)
6.5
sonarkaushik

Sir, I can do the project. Refer PMB. Looking for further discussions in this matter. with thanks and regards

$250 USD in 7 days
(65 Reviews)
5.7
buraqtech

Check your PMB for details!!!

$499 USD in 10 days
(2 Reviews)
5.2
pandey2008

plz view [url removed, login to view]

$250 USD in 6 days
(72 Reviews)
5.2
shanki161

Hello Sir>>>>>>Genuine and Reliable<<<<<<<< I am ready to [url removed, login to view] read all the [url removed, login to view] have a team of 6 [url removed, login to view] guarantee you we will deliver under the deadline. Waiting for your positive response. Thanks More

$385 USD in 3 days
(17 Reviews)
4.9
abupabuya

hi sir I just want sale my items if you interested.. .May be you need a mailing list to use in your business i have 500,000 USA EMAIL Business list- all have email,

$264 USD in 1 day
(10 Reviews)
4.0
fhasanbd

I can do this for you.....thanks

$500 USD in 12 days
(13 Reviews)
3.8
drwizsl

Ready to start

$250 USD in 3 days
(5 Reviews)
2.8
dewsoft1

Dear Sir, we are pleased to inform you that we have studied all the requirements and can deliver the same to u . we have similar work experience and can handle this quite well We develop and implement elegant & More

$467 USD in 12 days
(0 Reviews)
0.0
bik1383

I would like to go for this project.

$320 USD in 6 days
(0 Reviews)
0.0