Job requires the extraction of certain information from a website and put into an MS excel 2003 spreadheet.
There are up to 17 fields, some entries will not have information for all 17. e.g. the entry might not list a fax number so the fax field needs to be left blank.
Some of the information to be extracted needs to be split into multiple cells. e.g. the address might be listed as "123 Test Road, Testtown, Testville" this needs to be split into 3 different address fields and have the commas removed.
I am unsure exactly how many entries there are but have estimated that there would be betwee 1000-1800.
Some will be dublicates, these can be left in the list.
I require 100% accuracy. This means that all entires will be extracted and that each field contains the correct data. e.g. I dont want the fax number listed in the phone field etc.
Details of the site will be given in PMB.
An example spreadhseet layout will also be given in PMB