Developing ETL pipeline using Azure, SSIS, Python ,SQL

I have to build an ETL pipeline of a data from a collaborating hospital data csv file.

Goal: Store the data in a cleaned and structured format into a database/file of choice. Write the code in Python or language of choice. Design a solution that can be scaled to TB of records.


1. Make assumptions and justify them where things are unclear with comments in the code.

2. Write unit tests for all your functions.

3. Write data tests to ensure that the data is correct.

4. Remove Protected health information (PHI): Names, Addresses etc.

5. Clean data. Remove invalid values. Normalize it where reasonable.

6. Add a column that calculates the average of all three glucose measurement time points.

7. Add a column based on the average of all three glucose measurement time points that indicates whether it’s normal, prediabetes or diabetes.

8. Store data in a database or file format of choice.

Skills: Python, Data Warehousing, Database Administration, ETL, MySQL

See more: books developing mobile applications using net35, ssis without sql 2000, developing prado apps using zend, developing online quiz using php mysql, using xcode ruby python php perl development, developing tabed menu using javascript, project developing online store using aspnet, tomcat version developing web services using jdk14 eclipse, etl project using sigma, developing data base using, using adwords api integrates sql, developing wap sites using aspnet, python etl pipeline, etl automation using python, azure devops python pipeline, python etl pipeline example, trigger azure data factory pipeline using rest api, modular image processing pipeline using opencv and python generators, etl pipeline python

About the Employer:
( 2 reviews ) DUBLIN, United States

Project ID: #29444371

7 freelancers are bidding on average $120 for this job


I can qualitatively design and develop required ETL using MS SQL Server because I am Senior MS SQL/BI Developer with more than 10 years of exceptional professional experience.

$130 USD in 3 days
(30 Reviews)

Hello i am expertise in sql queries and etl processing using ssis ping me if you are interested and give more information

$200 USD in 3 days
(9 Reviews)

Hi. I will suggest to use excel Power Query for data retrieval from files and the manipulation. Please chat for more detail. Johnny

$200 USD in 7 days
(1 Review)

Hi, I'm interested in Data Science. I worked SparkSql. I can deliver in 5 days. Working in coordination is my priority. I hope you contact me. Best Regards.

$170 USD in 5 days
(1 Review)

PYTHON JAVA PHP CSS HTML WOOCOMMERCE WORDPRESS CYBERSECURITY I'm a Linux Professional with over 5+ years of verifiable experience in the Web Hosting industry, I'm in the ideal position to offer a wide variety of Linux More

$20 USD in 7 days
(0 Reviews)

Hello, I am an experienced ETL /BI developer with around 5 years of experience working in data analysis and ETL development for retail and e-commerce clients. Delivered more than 50 dashboards and ETL solutions using S More

$20 USD in 4 days
(0 Reviews)

Hello, I am a Microsoft Certified Data Analyst and Business Intelligence Developer and Trainer with over 3 years experience building enterprise data warehouses, data analytics and business intelligence models, reports More

$100 USD in 7 days
(0 Reviews)