Data merging and cleaning of about 20,000 records from 8 different excel files

CLOSED
Bids
70
Avg Bid (USD)
$117
Project Budget (USD)
$30 - $250

Project Description:
I have about 20,000 records of companies from 8-10 different sources in excel tables. The data has a number of different fields with some tables having more data than others. Most of the table have a minimum of
:: BUSINESS NAME
:: ADDRESS
:: CITY, STATE, ZIP
:: PHONE
:: FAX
:: EMAIL
::WEBSITE
:: Contact Name
:: SOURCE
:: Additional Fields could include: Keyword (specific to source), Primary NAIC code, Rating (Yelp, Yellowpages, Houzz), Trade Associations, Project Size, Year Established, Category, License(s), Geographic area served, Map ID (URL) and others...

I would like bids for the following data merging and cleaning deliverables:
:: combine files into one master file encompassing all of the fields available across the files (save this file separately)
:: merge any duplicate records, appending any additional fields so that each company record is complete
:: remove any duplicate records that couldn't be merged to or appended to the other existing records (save this as a separate worksheet)
:: sort by state and provide me with counts by state (please enter summary in separate worksheet)

Accuracy, and attention to detail are key for this project. We are looking for someone with SQL or high level excel skills to complete this task using scripts or cleaning tools. After as much automated merging and cleaning as possible can be completed, please review a sample of the records manually to ensure accuracy. We will check final records for accuracy and to ensure that no companies have been deleted....

Looking forward to working with you. We will select a bidder early next week (May 6, 2013)and hope the project can be completed in a few days.

Additional Project Description:
05/02/2013 at 22:20 IST
Two additional notes:
:: Please provide the final files in excel (.xls or.xlsx) format
:: I will only forward sample files to those shortlisted. I'll have a sample excel file with all of the fields I'd like to see in the master record, but you might find more when working through the files

I am still collecting some records this week so this project won't begin until next week


05/04/2013 at 1:07 IST
We will have future additional data to clean and merge with the master file so any suggestions that will help automate this task for us going forward will help us...

Thanks for bidding. I will award this project on Monday.

Skills required:
Big Data, Data Processing, Database Administration, Excel, SQL
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


Hire greggfletcher
$ 262
in 5 days
Hire pishty
$ 100
in 3 days
$ 79
in 4 days
$ 99
in 10 days
$ 84
in 3 days
$ 99
in 3 days
$ 110
in 3 days
$ 275
in 3 days
$ 30
in 7 days
Hire innovese
$ 165
in 3 days