marclamberti / docker-airflow
Docker Airflow - Contains a docker compose file for Airflow 2.0
☆56Updated 2 years ago
Related projects: ⓘ
- Resources for video demonstrations and blog posts related to DataOps on AWS☆166Updated 2 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆111Updated last year
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆184Updated 9 months ago
- Code for dbt tutorial☆138Updated 3 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆71Updated last year
- Delta-Lake, ETL, Spark, Airflow☆42Updated last year
- Project for "Data pipeline design patterns" blog.☆41Updated last month
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- Simple stream processing pipeline☆89Updated 3 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆187Updated this week
- Docker with Airflow and Spark standalone cluster☆239Updated last year
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆100Updated 2 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆167Updated last year
- Spark data pipeline that processes movie ratings data.☆26Updated last month
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆81Updated 4 months ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆84Updated 3 years ago
- Materials for the next course☆22Updated last year
- Data lake, data warehouse on GCP☆54Updated 2 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆31Updated 3 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated last year
- Sample project to demonstrate data engineering best practices☆156Updated 6 months ago
- ☆38Updated this week
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 2 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆52Updated 5 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆155Updated 3 years ago
- Delta Lake Documentation☆45Updated 3 months ago
- ☆48Updated 2 years ago
- ☆20Updated 3 years ago
- End to end data engineering project☆49Updated last year
- ☆56Updated 3 years ago