marclamberti / airflow-materials-aws
Materials for the next course
☆24Updated 2 years ago
Alternatives and similar repositories for airflow-materials-aws:
Users that are interested in airflow-materials-aws are comparing it to the libraries listed below
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆175Updated 3 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- Airflow helm chart for AWS EKS☆18Updated 4 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Updated 2 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 2 years ago
- ☆21Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- Pyspark boilerplate for running prod ready data pipeline☆28Updated 4 years ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆19Updated 6 months ago
- Execution of DBT models using Apache Airflow through Docker Compose☆116Updated 2 years ago
- Spark data pipeline that processes movie ratings data.☆28Updated last month
- Data engineering with dbt, published by Packt☆77Updated last year
- Sample Airflow DAGs☆62Updated 2 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆172Updated last year
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆33Updated 4 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆54Updated 2 years ago
- ☆38Updated 4 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- Cloned by the `dbt init` task☆61Updated last year
- Airflow training for the crunch conf☆105Updated 6 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆115Updated 5 months ago
- ☆55Updated 3 months ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆87Updated 4 years ago