apache / airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆40,052Updated this week
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,268Updated 3 weeks ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,290Updated this week
- Docker Apache Airflow☆3,801Updated 2 years ago
- An orchestration platform for the development, production, and observation of data assets.☆13,121Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platform☆66,255Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,329Updated this week
- Machine Learning Toolkit for Kubernetes☆14,966Updated last month
- Always know what to expect from your data.☆10,386Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆41,110Updated this week
- Optional static typing for Python☆19,283Updated this week
- Parallel computing with task scheduling☆13,204Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,415Updated this week
- A time-series database for high-performance real-time analytics packaged as a Postgres extension☆19,104Updated this week
- Build, Manage and Deploy AI/ML Systems☆8,807Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,363Updated 2 months ago
- Python packaging and dependency management made easy☆33,164Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,484Updated last week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,315Updated this week
- The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems☆69,740Updated this week
- Apache Flink☆24,848Updated this week
- Daemon for easy but powerful stats aggregation☆17,816Updated 3 months ago
- Mirror of Apache Kafka☆30,075Updated this week
- Fluentd: Unified Logging Layer (project under CNCF)☆13,142Updated this week
- Apache NiFi☆5,319Updated this week
- The uncompromising Python code formatter☆40,234Updated this week
- Streamlit — A faster way to build and share data apps.☆39,330Updated this week
- TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.☆38,427Updated this week
- MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.☆52,393Updated last week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆41,982Updated this week
- A curated list of data engineering tools for software developers☆7,364Updated 3 weeks ago