marclamberti / training_materialsLinks
☆12Updated last year
Alternatives and similar repositories for training_materials
Users that are interested in training_materials are comparing it to the libraries listed below
Sorting:
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- ☆55Updated last week
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Template for Data Engineering and Data Pipeline projects☆112Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆81Updated last month
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆117Updated 2 years ago
- Challenge Data Engineer☆25Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆25Updated 2 years ago
- An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information☆26Updated 3 years ago
- This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source to…☆30Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- Code for dbt tutorial☆157Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆163Updated 3 years ago
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆62Updated 3 weeks ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- Enforce Best Practices for all your Airflow DAGs. ⭐☆101Updated this week
- Cost Efficient Data Pipelines with DuckDB☆53Updated 3 weeks ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- ☆130Updated 10 months ago
- Code to demonstrate data engineering metadata & logging best practices☆16Updated last year
- Cloned by the `dbt init` task☆61Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆220Updated last month
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 4 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Updated 4 years ago
- A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive M…☆46Updated 5 months ago