godatadriven / data-pipelines-with-airflow-2nd-edLinks
Code for the second edition of Data Pipelines with Apache Airflow Book
☆23Updated last week
Alternatives and similar repositories for data-pipelines-with-airflow-2nd-ed
Users that are interested in data-pipelines-with-airflow-2nd-ed are comparing it to the libraries listed below
Sorting:
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 9 months ago
- ☆48Updated 4 years ago
- ☆190Updated 4 years ago
- ☆93Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆89Updated 4 years ago
- Django-based course management platform for Zoomcamps☆76Updated this week
- Data engineering with dbt, published by Packt☆88Updated 3 months ago
- Code snippets for Data Engineering Design Patterns book☆294Updated 8 months ago
- Code repository for the "PySpark in Action" book☆212Updated 6 months ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- ☆23Updated 4 years ago
- Code for dbt tutorial☆165Updated 3 months ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Updated 4 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆69Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆106Updated 2 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆126Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Updated 4 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆95Updated 6 years ago
- Project for "Data pipeline design patterns" blog.☆47Updated last year
- Mastering Big Data Analytics with PySpark, Published by Packt☆163Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 5 years ago
- ☆10Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆202Updated last year
- ☆88Updated 3 years ago
- Snowflake Cookbook, published by Packt☆82Updated 2 years ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆139Updated 2 years ago
- Companion repository to the ETL & ELT Pipelines with Apache Airflow® eBook☆36Updated last month
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆143Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated 2 years ago