With 10 years of overall experience, I specialize in designing, developing, and executing massive data pipelines, data lakes, and scalable ingestion systems on the Azure Cloud Platform.
My approach blends a deep understanding of Big Data technologies with modern cloud architecture. Whether it's Snowflake optimization, Apache Spark processing, or automated CI/CD data pipelines, I bridge the gap between raw data and actionable business intelligence.
I am a results-driven Data Engineer with 10 years of expertise and 5 focused years designing and implementing scalable data ingestion pipelines using Azure Data Factory. Over the years, I've successfully executed data lake requirements in numerous large companies using the Big Data Technology stack (Python, Spark, Hadoop, Hive).
I am proficient in leveraging Azure Databricks and Spark for distributed processing, and adept at designing cloud-based data warehouse solutions using Snowflake on Azure. I work collaboratively with stakeholders to implement logical and physical data models, ensuring performance, scalability, and data integrity.
Deep expertise in Multi-Cluster, Time Travel, cloning and performance optimization.
Strong track record optimizing Spark jobs and distributed processing pipelines.
Real-time data architecture using Kafka and Spark Streaming.
Automating robust data pipeline deployments in Azure DevOps.
Aug 2022 – Present
Oct 2020 – Jul 2022 | Dallas, TX
July 2019 – Sep 2020 | Hartford, CT
April 2018 – June 2019 | Chicago, IL
May 2015 – Mar 2018 | Rochester, MN
Nov 2013 – Apr 2015 | Chicago, IL
Oregon State University
Sep 2023 - Mar 2025 | CGPA: 3.86
Skills: Algorithms, Machine Learning, Database Management (DBMS), Data Science
Amrita Vishwa Vidyapeetham
Jul 2017 - Jun 2021 | CGPA: 7.68
Skills: Data Structures, Operating Systems, Algorithms, Big Data Analytics
Sasi Junior College, Velivennu
Jun 2015 - Jul 2017
St. Ann's E.M School, Rajahmundry
Jun 2002 - May 2015