You have chosen to sponsor your bid up to a maximum amount of .
I have about 20,000 records of companies from 8-10 different sources in excel tables. The data has a number of different fields with some tables having more data than others. Most of the table have a minimum of
:: BUSINESS NAME
:: CITY, STATE, ZIP
:: Contact Name
:: Additional Fields could include: Keyword (specific to source), Primary NAIC code, Rating (Yelp, Yellowpages, Houzz), Trade Associations, Project Size, Year Established, Category, License(s), Geographic area served, Map ID (URL) and others...
I would like bids for the following data merging and cleaning deliverables:
:: combine files into one master file encompassing all of the fields available across the files (save this file separately)
:: merge any duplicate records, appending any additional fields so that each company record is complete
:: remove any duplicate records that couldn't be merged to or appended to the other existing records (save this as a separate worksheet)
:: sort by state and provide me with counts by state (please enter summary in separate worksheet)
Accuracy, and attention to detail are key for this project. We are looking for someone with SQL or high level excel skills to complete this task using scripts or cleaning tools. After as much automated merging and cleaning as possible can be completed, please review a sample of the records manually to ensure accuracy. We will check final records for accuracy and to ensure that no companies have been deleted....
Looking forward to working with you. We will select a bidder early next week (May 6, 2013)and hope the project can be completed in a few days.
Additional Project Description:
05/02/2013 at 22:20 IST
Two additional notes:
:: Please provide the final files in excel (.xls or.xlsx) format
:: I will only forward sample files to those shortlisted. I'll have a sample excel file with all of the fields I'd like to see in the master record, but you might find more when working through the files
I am still collecting some records this week so this project won't begin until next week
05/04/2013 at 1:07 IST
We will have future additional data to clean and merge with the master file so any suggestions that will help automate this task for us going forward will help us...
Thanks for bidding. I will award this project on Monday.