Skip to content
#

delta-live-tables

Here are 34 public repositories matching this topic...

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

  • Updated Apr 24, 2026
  • Python

Azure End To End Data Engineering Project | Azure Data Factory | Azure Databricks | Azure SQL DB | PySpark | Big Data. It is a in depth Data Engineering project using powerful tools like Azure Data Factory, Azure SQL DB, Azure Databricks, Unity Catalog, Delta Live Tables, Spark Streaming, PySpark, Databricks Asset Bundles, GitHub, and more.

  • Updated Jan 12, 2026
  • Python

Real Estate ELT pipeline using Databricks Asset Bundles on GCP. Ingests, transforms, and analyzes property data via Delta Live Tables. Follows medallion architecture (Bronze/Silver/Gold), modular Python design, CI/CD automation with GitHub Actions, and full Unit and Integration tests coverage.

  • Updated Jul 22, 2025
  • Python

End-to-end Azure Data Engineering project using ADF for incremental ingestion, Databricks (DLT) for Medallion Architecture, and Delta Lake for CDC (SCD Type 1). Managed via Databricks Asset Bundles (DABs) for professional CI/CD. Focuses on real-time streaming, scalability, and Star Schema modeling.

  • Updated Jan 29, 2026
  • Python

Production-ready data pipeline using Azure Databricks Delta Live Tables with Medallion Architecture. Processes airline data via streaming (Event Hubs) and batch (ADLS Gen2) ingestion, enforces data quality constraints, and delivers analytics through Power BI dashboards. Demonstrates DLT, CDC, and Auto Loader.

  • Updated Nov 6, 2025
  • Jupyter Notebook

This project implements a modern data engineering pipeline using Databricks, PySpark, DBT, and Delta Live Tables. It follows the Medallion Architecture, supports realtime data ingestion with Autoloader, and models data with fact and dimension tables, including Slowly Changing Dimensions (SCD Type 2), all orchestrated in a scalable cloud environment

  • Updated Jul 15, 2025

Production-grade AWS + Databricks data quality platform. Multi-engine (Great Expectations + Soda + custom PySpark + DLT), DynamoDB results catalog with 3 GSIs, Unity Catalog lineage, SLO-driven burn-rate alerting (Slack + PagerDuty), Grafana dashboards, 15 Terraform modules.

  • Updated Apr 23, 2026
  • Python

Improve this page

Add a description, image, and links to the delta-live-tables topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the delta-live-tables topic, visit your repo's landing page and select "manage topics."

Learn more