Find Jobs
Hire Freelancers

Clean a database file with >30 million records

$30-250 USD

In Progress
Posted about 9 years ago

$30-250 USD

Paid on delivery
We got a database with with > 30 million records stored in multiple CSV-files. Before being able to import these records into our CRM, we have to clean and consolidate the data. I'm looking for a trustworthy person who can help us to a) consolidate all data in one file b) eleminate all records without a value in one particular field IDNUMBER, c) clean 2-3 fields to numbers only without loosing leading zeros, d) find and eleminate all double entries based on field IDNUMBER (this field shall become the unique identifier) e) deliver the resulting database in MYSQL-format and CSV-format
Project ID: 7239697

About the project

5 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
5 freelancers are bidding on average $123 USD for this job
User Avatar
A proposal has not yet been provided
$45 USD in 3 days
4.8 (28 reviews)
4.8
4.8
User Avatar
Hi There, good day! I have done this cleaning job in the past, i am good at excel and data handling. Will create master file with all unique records in-them. Will remove all duplicates and align the data in proper order. Am good at excel, formulas. let em knwo if i can do this for you. Thank you Anupama.
$115 USD in 3 days
4.6 (31 reviews)
4.5
4.5
User Avatar
Dear Sir/Madam, My bid is on higher side as I know the pain in handling huge amount of data especailly when this is excel file. I have 20 years of IT experience and mostly in Database, Oracle, Sybase, Postgresql, SQL Server and MySQL. My brief background: I spent 13 years in USA as senior DBA mostly in US federal bodies like SEC, NASA, SBA, and FCC. I have great references from them. I also worked at Citibank singapore on behalf of Nucleus software. Here is the approach which I am going to take: 1. From each excel file I will upload the data into MySQL in all text field to contain all leading zeros without any primary key. 2. Will remove all the rows with blank primary key (IDNUMBER) 3. Will move all the distinct rows in a separate table or delete the duplicate rows if I find any search criteria for the duplicate. 4. Finally will create Primarykey on the filtered rows. If index created successfully then everything should be fine. 5. Export in a file and will send the mysql database backup(dump) Appreciate your attention. Warm regards, Pranesh Sinha
$250 USD in 4 days
5.0 (2 reviews)
2.9
2.9
User Avatar
A proposal has not yet been provided
$45 USD in 3 days
0.0 (1 review)
0.0
0.0
User Avatar
I've already worked with CSV files, that means import through a PHP Script into MySQL. I'm also pretty good with this DBMS. As Software Engineer this is a very easy task for me. Ready to start ASAP !!!
$161 USD in 3 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED KINGDOM
Manchester, United Kingdom
5.0
3
Payment method verified
Member since Sep 26, 2014

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.