anilkulkarni87 / airflow-dockerLinks
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
☆34Updated last year
Alternatives and similar repositories for airflow-docker
Users that are interested in airflow-docker are comparing it to the libraries listed below
Sorting:
- ☆45Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆44Updated last year
- Project for real-time anomaly detection using Kafka and python☆59Updated 3 years ago
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆17Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- ☆12Updated 3 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆47Updated 2 years ago
- Content related to Mastering Postgresql along with videos.☆18Updated 4 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆70Updated 3 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆43Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Updated last month
- A Series of Notebooks on how to start with Kafka and Python☆151Updated 11 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Updated 2 years ago
- Spark, Airflow, Kafka☆24Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆49Updated 2 years ago
- ☆32Updated 2 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated 2 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Updated 2 years ago
- build dw with dbt☆50Updated last year
- Airflow Tutorials☆25Updated 4 years ago
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆69Updated this week
- Stream/batch system with Hadoop, Spark on NYC taxi data | #DE☆26Updated 4 months ago
- End-to-end data platform leveraging the Modern data stack☆52Updated last year
- Building a Data Pipeline with an Open Source Stack☆55Updated 7 months ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Updated 4 years ago
- Kafka variant of the MLOps Level 1 stack☆27Updated 3 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆141Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56Updated 2 years ago