Mihályi Dávid

Data Engineer building reliable, usable data platforms.

Microsoft Fabric • Azure • PySpark • Delta Lake • Power BI • Data Quality (Great Expectations)

Projects I'm currently working on

Microsoft Fabric Sustainability Platform

Designed and scaled a sustainability data platform for a global manufacturer: ingestion to Lakehouse, transformations, Direct Lake semantic models, data-quality pipelines, and governed reporting in Power BI.

  • Great Expectations suites for row/table checks
  • Semantic models via API, RLS roles
  • Optimized CU consumption, shortcuts across workspaces

Portfolio Playground

Explore live notebooks, interactive reports, and an experimental RAG assistant that mirror how I build, validate, and explain data products.

Streaming Quality Monitoring Notebook

A Fabric notebook that ingests event streams, flags quality regressions with Great Expectations, and surfaces fixes via Delta tables.

Replace the link above with your published notebook viewer URL when ready.

Executive Fabric Scorecard

Power BI Direct Lake dashboard tracking sustainability KPIs, CU consumption, and data quality SLAs.

Embedded Fabric Power BI report authenticates automatically for signed-in visitors.

Fabric Knowledge RAG Chat

Prototype assistant that answers stakeholder questions from curated Fabric playbooks and architecture notes.

Lead: How quickly can we onboard a new data domain?

RAG Assistant: The ingestion bootstrap takes ~3 days with our template pipelines. Day 1 provisioning, day 2 quality baselines, day 3 semantic model + dashboards.

Hook this widget up to your RAG endpoint by posting the prompt and streaming the response.

Professional Experience

2025 Jan – Present · Accenture

Data Engineer

  • Developed Python scripts for data preparation and transformation, ensuring clean and usable data for business intelligence applications.
  • Conducted in-depth data analysis and developed business intelligence dashboards with Power BI, providing actionable insights for decision-making.
  • Helped in architecture design for cloud solutions using Microsoft Azure.
  • Collaborated with cross-functional teams to design and deploy machine learning AI models using TensorFlow or Azure AI Studio, driving innovation in client projects.
  • Developed and maintained custom software solutions using Python.
  • Facilitated workshops and training sessions to upskill client teams in the latest technologies and best practices.
  • Wrote SQL scripts and stored procedures for data manipulation and extraction, optimizing database performance for BI use cases.
  • Created proof-of-concepts using Azure Data and AI solutions while staying current with the latest Azure services and technologies.
  • Organized and facilitated meetings, ensuring clear communication and alignment between all stakeholders.
2023 Mar – 2024 Dec · Ernst & Young

Technology Consultant

  • Developed Python scripts for data preparation and transformation, ensuring clean and usable data for business intelligence applications.
  • Conducted in-depth data analysis and developed business intelligence dashboards with Power BI, providing actionable insights for decision-making.
  • Helped in architecture design for cloud solutions using Microsoft Azure.
  • Collaborated with cross-functional teams to design and deploy machine learning AI models using TensorFlow or Azure AI Studio, driving innovation in client projects.
  • Developed and maintained custom software solutions using Python.
  • Facilitated workshops and training sessions to upskill client teams in the latest technologies and best practices.
  • Wrote SQL scripts and stored procedures for data manipulation and extraction, optimizing database performance for BI use cases.
  • Created proof-of-concepts using Azure Data and AI solutions while staying current with the latest Azure services and technologies.
  • Organized and facilitated meetings, ensuring clear communication and alignment between all stakeholders.

Skills & Certifications

Cloud & Platform

Azure, Microsoft Fabric, Databricks, Synapse

Data Engineering

PySpark, SQL, Delta Lake, Data Modeling, CI/CD

Analytics

Power BI (Direct Lake), DAX, SemPy, Visualization

AI & ML

Machine Learning, Azure AI Studio, TensorFlow, MLOps practices, Responsible AI

Certifications

Azure Fundamentals AZ-900
Azure Data Fundamentals DP-900
Azure Data Engineer Associate DP-203
Azure AI Fundamentals AI-900
Azure AI Engineer Associate AI-102
Power BI Data Analyst Associate PL-300
Azure Administrator Associate AZ-104
Fabric Data Engineer Associate DP-700
Databricks Data Engineer Associate DB-DEA
Databricks Generative AI Associate DB-GEN-AI
Databricks Machine Learning Associate DB-ML

Book a Call

Free 30‑Minute Discovery

Pick a slot that works for you and we’ll dive into your Fabric, Databricks, or BI challenges. I’ll review your current setup beforehand so we can spend the call on solutions, not status updates.

Quick discovery of goals, blockers, and success metrics.
Follow-up summary with next-step recommendations.
Pick a time · Europe/Budapest Open in new tab

Replace the Calendar links with your appointment scheduling URL to activate the live booking flow.

Contact

Open to data engineering and analytics projects. For inquiries, collaborations, or coffee:

Email LinkedIn GitHub