We require two website directories to be scraped, the first contains in the region of 60,000 records, the second contains approximately 20,000. The websites display publicly available information (no passwords required) and the data is consistently laid out.
The scraped data should be placed into an excel spreadsheet. The nature of the data is predominantly contact information so we will require the company name, address, contact person and contact phone/fax/email to be populated in excel. Other info such as the company’s industry sector should also be recorded. The data must be captured in excel as per predefined fields that we will outline, for example splitting an address as unit no, building, street, city etc. Across all fields that we require, this will create up to around 30 cells per entity to be populated. We will ultimately be uploading the excel spreadsheet into SQL Server, so appreciation of preparing the data so that it is compatible with SQL Server is also important.
You should be able to produce examples of websites that you have scraped.
This is somewhat a test case project as there is plenty more web scraping project work to follow and providing that whoever is awarded this job performs it successfully, then they will be well positioned when bidding for the subsequent projects.
The job must be executed quickly and accurately, and frequent communication should be maintained throughout.
43 freelancers are bidding on average £80 for this job
Greetings sir, i am an expert freelancer. for this job and your 100% satisfaction is assured if you allow me to serve, for more info please cheek your message box for this project(Private)
Hello, can you please provide the two website names which have those 80,000 records first? I did similar scraping projects before, like: yellowpages, yelp. Thanks in advance. :)