Data Engineer, MS (DAE), BE

Hi, I am SATHVIK RAMAPPA

My Resume!!!

Who Am I ...?

I'm a Data Engineer with 2+ years of experience building the data infrastructure that teams actually rely on — pipelines that run reliably, schemas that hold under pressure, and systems that don't break when the data gets messy. I hold an MS in Data Analytics Engineering from Northeastern University and have worked across companies like Nike (via Merkle) and Northeastern University, engineering ETL/ELT pipelines at scale, designing data models in BigQuery and Snowflake, and owning data quality end to end. My toolkit includes Python, SQL, BigQuery, GCP, AWS, Snowflake, PostgreSQL, MongoDB, and Neo4j. I'm drawn to hard data problems where a single engineer can have outsized impact — building systems with clean ownership, documented schemas, and data contracts that actually mean something.

Proficient in ...

My core stack is Python and SQL, with deep experience in ETL/ELT pipeline design, data modeling, and schema architecture. I've built and maintained data warehouses on BigQuery and Snowflake, and worked across cloud platforms including GCP, AWS, and Microsoft Azure. On the database side, I'm hands-on with PostgreSQL, MongoDB, Neo4j, and MySQL — both relational and NoSQL. I've also worked with pipeline orchestration tools including Airflow, dbt, Spark, and Databricks, and I care deeply about data quality infrastructure: schema validation, freshness SLAs, lineage tracking, and data contracts.

My Work Experience

At Merkle Inc (Dentsu Global Services), I worked as a Data Engineer across two client engagements. At Nike, I engineered Python-based ETL pipelines to extract, transform, and load millions of user records from BigQuery into structured datasets, designed and optimized SQL data models on GCP, and automated end-to-end data transformation workflows — cutting manual processing time by 30%. At The Home Depot, I built Python ingestion and preprocessing pipelines to collect, clean, and structure high-volume text data, designed batch transformation pipelines to extract structured entity-level signals, and developed automated data feeds into reporting dashboards.

During my internship at OpenDataDSL, I engineered time series data pipelines to process high-frequency IoT sensor data from water treatment systems using Python and SQL, detecting anomalies and modeling operational patterns at scale. I deployed forecasting solutions on Microsoft Azure using cloud-native parallel computing, reducing model training and inference time by 50%, and integrated predictive maintenance capabilities into production workflows by connecting sensor outputs with models to improve system reliability.

Data Engineering Projects ...

Here are selected projects that showcase my work in data engineering, pipeline architecture, and database systems.

Urban Energy Grid Database System

Developed urban energy grid database at Northeastern, managing 10,000+ data points with MySQL and MongoDB. Created efficient structures and user-friendly website using Flask/Django for customer insights and admin monitoring. Utilized advanced querying for real-time analysis.

MongoDB MySQL ETL Flask & Django Python

Certifications that I've obtained ...

Here are few of the certifications that I have obtained

Data Visualization Dashboard

MongoDB Python Developer

Gained hands-on experience in data science and machine learning, covering methodology, tools, Python, SQL, data visualization, and analysis. Completed cloud-based labs, assignments, and a capstone project to apply their skills.

Python NoSQL MongoDB CLI MongoDB Compass MongoDB Atlas
Data Visualization Dashboard

IBM Data Science Professional Certificate

Gained hands-on experience in data science and machine learning, covering methodology, tools, Python, SQL, data visualization, and analysis. Completed cloud-based labs, assignments, and a capstone project to apply their skills.

Python SQL IBM Watson Data Science
Predictive Maintenance Model

Deep Learning Specialization

Completed specialization covering neural network architectures, optimization techniques, and practical applications with Python and TensorFlow. Relevant to understanding ML pipeline infrastructure and model serving patterns in production data systems.

CNN RNN LSTM PyTorch OpenCV
Inventory Optimization System

Machine Learning Specialization

Gained expertise in modern machine learning, covering supervised, unsupervised, and reinforcement learning, recommender systems, and best practices. Developed practical skills to apply ML techniques to real-world problems.

Machine Learning Scikit-learn TensorFlow Keras

Miscellaneous ...

In addition to technology, I enjoy watercolor painting. I love reading science fiction and mystery novels for exciting adventures and exploring storytelling through mangas. I’m also a fan of anime and appreciate its diverse narratives and styles. Gaming is another interest of mine, especially like Valorant and Assassin's Creed Valhalla.

Reach out to me ...

Here's how you can connect with me!!!

You can download my resume here