data-burst / airflow-git-sync
Sync DAG changes from Git to Airflow
☆55Updated 6 months ago
Alternatives and similar repositories for airflow-git-sync:
Users that are interested in airflow-git-sync are comparing it to the libraries listed below
- Dockerized monitoring stack for Apache Airflow☆26Updated 6 months ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆54Updated last year
- Set up your Ubuntu system with essential and fun packages☆21Updated 9 months ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆83Updated 10 months ago
- ☆320Updated 2 months ago
- dbt + Trino demo project, using TPC-H sample data☆19Updated 11 months ago
- Python rate limit☆33Updated 2 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- Sparglim ✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆36Updated last week
- Mockafka-py is a Python library designed for in-memory mocking of Kafka.[aiokafka - confluence-kafka-python]☆48Updated last week
- ☆49Updated last week
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆52Updated last year
- A Grafana datasource plugin for Apache Kafka☆43Updated last year
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆16Updated 9 months ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆175Updated last week
- Code snippets for Data Engineering Design Patterns book☆74Updated last month
- Presto Trino with Apache Hive Postgres metastore☆40Updated 6 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆72Updated 3 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆212Updated this week
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆32Updated last year
- ☆15Updated last year
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28Updated last month
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆61Updated this week
- Build Data Lake using Open Source tools☆91Updated 4 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆39Updated 4 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆64Updated 5 months ago
- Utility functions for dbt projects running on Trino☆21Updated last year
- A modern python scheduling framework with dependency injection and modular integration support. Alternative for Rocketry or apscheduler☆180Updated 5 months ago
- New generation opensource data stack☆65Updated 2 years ago