Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆44,430Updated this week
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Apache Superset is a Data Visualization and Data Exploration Platform☆70,661Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,681Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,652Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,236Feb 20, 2026Updated last week
- Apache Spark - A unified analytics engine for large-scale data processing☆42,898Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆15,007Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆46,088Updated this week
- ClickHouse® is a real-time analytics database management system☆46,035Updated this week
- Distributed Task Queue (development branch)☆28,152Updated this week
- The Prometheus monitoring system and time series database.☆62,945Updated this week
- FastAPI framework, high performance, easy to learn, fast to code, ready for production☆95,554Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,279Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆20,749Updated this week
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, …☆24,365Updated this week
- Parallel computing with task scheduling☆13,746Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,413Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆72,401Updated this week
- 🦍 The API and AI Gateway☆42,818Jan 19, 2026Updated last month
- Streamlit — A faster way to build and share data apps.☆43,634Updated this week
- MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.☆60,395Feb 12, 2026Updated 2 weeks ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,529Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,942Updated this week
- Machine Learning Toolkit for Kubernetes☆15,462Jan 5, 2026Updated last month
- Workflow Engine for Kubernetes☆16,464Feb 20, 2026Updated last week
- The official home of the Presto distributed SQL query engine for big data☆16,662Updated this week
- Python packaging and dependency management made easy☆34,279Updated this week
- Production-Grade Container Scheduling and Management☆120,774Updated this week
- Connect, secure, control, and observe services.☆38,042Updated this week
- Docker Apache Airflow☆3,809Mar 1, 2023Updated 2 years ago
- DuckDB is an analytical in-process SQL database management system☆36,346Updated this week
- Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies A…☆47,807Updated this week
- Data validation using Python type hints☆26,977Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,513Updated this week
- Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate eve…☆68,143Updated this week
- Always know what to expect from your data.☆11,162Feb 20, 2026Updated last week
- The Cloud Native Application Proxy☆61,837Feb 20, 2026Updated last week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,492Updated this week
- The Metadata Platform for your Data and AI Stack☆11,608Updated this week
- TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.☆39,857Updated this week