bartosz25 / data-engineering-design-patterns-book
Code snippets for Data Engineering Design Patterns book
☆49Updated last week
Alternatives and similar repositories for data-engineering-design-patterns-book:
Users that are interested in data-engineering-design-patterns-book are comparing it to the libraries listed below
- Delta Lake Documentation☆48Updated 6 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆167Updated last week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated 3 months ago
- Delta Lake examples☆214Updated 3 months ago
- Sample project to demonstrate data engineering best practices☆174Updated 10 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆192Updated 3 weeks ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆204Updated this week
- Delta Lake helper methods in PySpark☆312Updated 4 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆118Updated 6 months ago
- ☆72Updated 3 months ago
- Simple stream processing pipeline☆94Updated 7 months ago
- ☆107Updated 5 months ago
- Delta Lake helper methods. No Spark dependency.☆22Updated 4 months ago
- Code for dbt tutorial☆149Updated 7 months ago
- ☆98Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆94Updated last month
- Template for Data Engineering and Data Pipeline projects☆106Updated 2 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆229Updated 2 months ago
- Quick Guides from Dremio on Several topics☆67Updated 2 months ago
- Code for my "Efficient Data Processing in SQL" book.☆54Updated 5 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆30Updated 10 months ago
- Code for "Efficient Data Processing in Spark" Course☆269Updated 3 months ago