I build scalable data infrastructure and quality frameworks that transform raw data into reliable analytics. With 3 years of experience on GCP, I've architected platforms validating over 1.3 trillion records and enabling data-driven decision making at enterprise scale.
I specialize in designing and building production-grade data platforms that handle massive scale while maintaining reliability, performance, and data quality.
Records Validated
Data Assets Managed
Performance Improvement
Engineering Hours Saved Monthly
Building data infrastructure that powers analytics and drives business decisions
Architected and built HSBC's ESG Data Quality framework from the ground up with the help of my team, establishing the technical foundation for enterprise-wide data governance and validation.
Built production-grade pipelines and infra across client envs with modern DE practices.
A comprehensive toolkit for building scalable, reliable data platforms
Python, SQL, Scala, PySpark – production-grade DE
BigQuery, Dataproc, GCS, Composer, CloudSQL
Spark (3.x), Airflow, ETL/ELT, Delta
Terraform, Jenkins, Maven, Git
Great Expectations, custom validations
Modeling, warehouse design, batch/stream
Kafka, event-driven
GDPR, security, observability
I'm always interested in discussing data engineering challenges and opportunities. Whether you're building new infra or optimizing pipelines, let's connect.