Project Description:
This project involves scraping data from the websites below
http://www.healthgrades.com/pediatrics-directory/ny-new-york/new-york#pagenumber=1&sortby=popularity&isdirectory=value&q=Pediatrics&loc=New+York%2C+NY&prem=&f.specialty=14&f.distance.display=middle&f.distance_ftop=100&f.distance=100
and
http://www.healthgrades.com/pediatric-nursing-directory
to compile a database of contacts for a targeted marketing program in the USA. The contact details should include all available contact data that can be obtained from the websites such as "Contact Name", "Title", "Areas of Specialty / Services Provided", "Business Name", "Address", "Phone", "Fax", "e-mail", "Contacts Website", "Data Source (i.e. source website url)", and so forth. The contact details should be provided in rows with column headings in Excel format. The specific list of websites contact directories are as follows: Please check attached document for project specific details.
According to the AAP (American Academy of Pediatrics) there are over 57,000 pediatricians in the US, so i would be expecting a similar figure after the data is scrapped.