Optimized performance of lot of ETL pipelines to perform 10 times faster with low level code
Having 5 years experience on Hadoop big data ecosystems (Spark, HDFS, Hive, Oozie)
Experience on developing configurable ETLs in Scala using Spark
Running Kubernetes and Hadoop spark jobs in AWS EKS and Oozie respectively