ananthdurai / airflow-training
Airflow training for the crunch conf
☆105Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for airflow-training
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆85Updated 3 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆72Updated last year
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆62Updated 4 years ago
- ☆196Updated last year
- How to build an awesome data engineering team☆99Updated 5 years ago
- Repository used for Spark Trainings☆53Updated last year
- ☆83Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Updated 2 years ago
- ☆25Updated last year
- Airflow Unit Tests and Integration Tests☆256Updated 2 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆113Updated last year
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- Data engineering interviews Q&A for data community by data community☆61Updated 4 years ago
- ETL pipeline using pyspark (Spark - Python)☆108Updated 4 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆333Updated 6 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆99Updated 4 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆167Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆73Updated 5 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated last year
- Guide for databricks spark certification☆58Updated 3 years ago
- Data lake, data warehouse on GCP☆54Updated 2 years ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆111Updated 4 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- A Python API for Asynchronously Loading Data into Snowflake DB -☆60Updated this week
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 4 months ago
- Simple stream processing pipeline☆92Updated 5 months ago