Data Engineer | Azure Databricks | PySpark | SQL | Cloud Data Pipelines
About Me Data Engineer with 1+ year of hands-on experience based in the Netherlands, focused on building reliable cloud data pipelines on Azure Databricks. I came to data engineering from 8+ years as a software engineer — which means I think about data platforms the way I used to think about production software: reliability, ownership, and things that don't break silently.
🚀 Portfolio (Pinned Repository):
👉 Data Engineering & Analytics Portfolio
A curated collection of industry-ready, end-to-end data engineering and analytics projects.
What I'm working on Building end-to-end lakehouse pipelines for clients — ingesting from APIs, SQL databases, and vendor files into medallion architecture, with real attention to data quality and incremental processing.
Currently working toward the DP-700 Microsoft Fabric Data Engineer Associate certification.
Designing modern data platforms and end-to-end data pipelines with strong emphasis on scalability, data quality, and analytics consumption, primarily on Azure, while remaining cloud-agnostic.
Cloud & Data Platforms: Azure · Azure Data Factory · Azure Data Lake Storage (Gen2) · Azure Databricks
Data Engineering: ETL / ELT Pipelines · Medallion Architecture · SQL · Python · Data Modeling · Data Quality & Validation
Analytics & BI: Power BI · Tableau · Excel · KPI Reporting
Engineering & Collaboration: Git · GitHub · Version Control · Documentation
- Data Engineer
- Azure Data Engineer
- Analytics Engineer
- Cloud Data Engineer
- Data Platform Engineer
- Data Analytics Engineer
🔗 LinkedIn: https://www.linkedin.com/in/amee-joshi-09b77754/
💻 GitHub: https://github.com/AmeeJoshi-MCA