anilkulkarni87 / airflow-dockerLinks
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
☆31Updated last year
Alternatives and similar repositories for airflow-docker
Users that are interested in airflow-docker are comparing it to the libraries listed below
Sorting:
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Project for real-time anomaly detection using Kafka and python☆58Updated 2 years ago
- ☆12Updated 3 years ago
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆16Updated last year
- A Series of Notebooks on how to start with Kafka and Python☆153Updated 4 months ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated last year
- Kafka variant of the MLOps Level 1 stack☆25Updated 3 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆68Updated 2 years ago
- Fully reproducible, Dockerized, step-by-step, tutorial on how to mock a "real-time" Kafka data stream from a timestamped csv file. Detai…☆40Updated 3 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Simple alert system implemented in Kafka and Python☆96Updated 7 years ago
- Superset Quick Start Guide, published by Packt☆56Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆18Updated 2 years ago
- ☆12Updated 3 years ago
- Apache Airflow advanced functionalities examples☆19Updated last year
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Analytics engineering with dbt - projects and developer environment☆19Updated 9 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆47Updated last year
- Content related to Mastering Postgresql along with videos.☆16Updated 3 years ago
- ☆37Updated 5 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Updated 3 years ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆37Updated last year
- Deploying a Machine Learning model streaming application with Apache Kafka☆10Updated 2 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆32Updated 4 years ago
- ☆41Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆100Updated 11 months ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated 2 years ago