About Me
🛠️ Behind the Data & AI 🧑💻
Hello! I'm Bhavya, a passionate Data Engineer with 2+ years of experience designing, building, and operating secure, high-throughput data pipelines in AWS-based, regulated enterprise environments. Proven track record of owning production systems end-to-end, optimizing CDC ingestion and distributed processing to scale data volumes while meeting strict accuracy, latency, and SLA requirements. Strong background in SQL and Python, cloud-native data platforms, and SDLC-driven delivery.
🚀 Current: Data Engineer @ UPS (Contract) Jul 2024 - Present
Own and operate mission-critical payroll and financial reporting datasets supporting downstream analytics, audits, and compliance workflows with strict accuracy, timeliness, and SLA guarantees. Design, develop, and maintain scalable CDC-based ingestion pipelines on AWS using S3 as the system of record, Glue and EMR for distributed processing, and Redshift for analytical storage. Re-architected CDC ingestion workflows enabling reliable processing of 10× higher data volumes while remaining within SLA.
🔒 Privacy Data Engineer @ Ardent Privacy Jul 2023 - Jun 2024
Designed and operated compliance-centric data pipelines powering enterprise privacy operations (DSAR, consent, audit, retention) across healthcare, finance, and government clients. Built event-driven ingestion pipelines using Kafka and AWS-native messaging patterns with exactly-once semantics and end-to-end auditability. Embedded privacy controls including data classification, masking, tokenization, and pseudonymization aligned with GDPR and HIPAA requirements.
🎓 Software Developer & Graduate Assistant @ UMBC Sep 2022 - Jun 2023
Engineered and maintained production web systems for university-wide platforms supporting academic and administrative workflows. Led accessibility remediation initiatives to achieve WCAG 2.1 and WCAG 2.2 compliance, aligning with Section 508 and ADA requirements. Developed backend integrations and data-backed services using Python and SQL, supporting reporting and operational use cases.
💻 Full Stack Software Developer @ Virtuals Design Apr 2020 - Aug 2022
Designed and developed backend services for multi-tenant SaaS platforms supporting thousands of daily users and data-driven business workflows. Built and maintained Java-based backend components for transactional processing, data validation, and integration with downstream systems. Developed RESTful APIs using Java, Python, and SQL to support data ingestion, transformation, and reporting use cases.
⚡ Tech Expertise
I deliver production-ready data engineering solutions with expertise in AWS data platforms, distributed processing, and CDC-based ingestion pipelines. My focus is on building reliable, scalable data systems that support analytics, reporting, and compliance workloads.
🏆 Achievements & Certifications
- 🥇IIT Bombay eYRC Finalist - Autonomous quadcopter rescue system
- 🏆Smart India Hackathon National Finalist - Analytics dashboard for Adani Ports
- ☁️Google Cloud Professional Data Engineer & Cloud Architect
- 🧱Databricks Certified Professional Data Engineer
- ☁️AWS Solutions Architect Professional
- ⚙️Certified Kubernetes Administrator (CKA)
- 🔒IAPP Certified Information Privacy Technologist (CIPT)
💬 Let's Connect!
I love connecting with engineers, students, and innovators! I share insights and provide advice on cloud architecture, data engineering, AI/ML, studying abroad, and career growth.
When I'm not coding or mentoring, you'll find me exploring new AI frameworks and staying at the cutting edge of responsible AI and MLOps.
Professional Certifications

Databricks Professional Data Engineer
View Credential
CKA: Certified Kubernetes Administrator
View Credential
IAPP CIPT: Certified Privacy Technologist
View Credential
AWS Certified Solutions Architect Professional
View Credential
Google Cloud Certified Professional Data Engineer
View Credential
Google Cloud Certified Professional Cloud Architect
View CredentialTechnical Skills
Hover over skills for details
Data Engineering
BigQuery & Data Platforms
Batch & Event Pipelines
Data Quality & Compliance
Analytics & BI
AI Data Systems & On-Call
Key Achievements
Delivering impactful solutions at scale
CDC Pipeline Architecture
Re-architected CDC ingestion workflows enabling reliable processing of 10× higher data volumes while remaining within SLA using AWS S3, Glue, EMR, and Redshift
On-Call & Incident Response
Primary on-call engineer for data pipelines, performing deep root-cause analysis on production failures and implementing durable fixes to prevent recurrence
Privacy & Compliance Engineering
Designed compliance-centric pipelines powering DSAR, consent, audit, and retention operations aligned with HIPAA, GDPR, SOC 2, and PCI DSS
Query & Cost Optimization
Tuned Redshift workloads through distribution styles, sort keys, and incremental load strategies to improve query performance and stabilize reporting
Data Quality & Validation
Implemented automated data quality validations, freshness SLIs, and reconciliation checks at multiple pipeline stages to prevent silent data corruption
CI/CD & Infrastructure
Supported CI/CD-driven pipeline deployments using infrastructure-as-code and SDLC best practices across development, UAT, and production environments
Education
Master of Science in Information Systems
University of Maryland Baltimore County
Get In Touch
Let's collaborate on your next big project
bhavyagada.dev@gmail.com
Phone
(571) 771-5507
Connect with me
GitHub
bhavyabgada
Topmate
Book a session
Kaggle
View competitions
Google Scholar
Research papers
Medium
Read my articles
X (Twitter)
Follow me
Ready to Build Something Amazing?
I'm always excited to discuss new opportunities, innovative projects, and ways to leverage data and AI to solve complex challenges.
Let's Connect