apache / airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆38,618Updated this week
Alternatives and similar repositories for airflow:
Users that are interested in airflow are comparing it to the libraries listed below
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,072Updated last week
- Docker Apache Airflow☆3,792Updated last year
- An orchestration platform for the development, production, and observation of data assets.☆12,470Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆18,202Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆40,477Updated this week
- ClickHouse® is a real-time analytics database management system☆38,865Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,602Updated this week
- The Prometheus monitoring system and time series database.☆57,098Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,334Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platform☆64,295Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆26,856Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,192Updated this week
- Apache NiFi☆5,068Updated this week
- Connect, secure, control, and observe services.☆36,442Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆40,482Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,454Updated 3 weeks ago
- MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.☆49,942Updated this week
- Open source platform for the machine learning lifecycle☆19,417Updated this week
- Cloud-native high-performance edge/middle/service proxy☆25,422Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,794Updated this week
- Apache Iceberg☆6,858Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆66,339Updated this week
- 🦍 The Cloud-Native API Gateway and AI Gateway.☆39,927Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆5,985Updated this week
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆10,794Updated this week
- Data-Centric Pipelines and Data Versioning☆6,201Updated this week
- Azkaban workflow manager.☆4,485Updated 7 months ago
- Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.☆11,027Updated this week
- Distributed Task Queue (development branch)☆25,465Updated this week
- Scalable datastore for metrics, events, and real-time analytics☆29,432Updated this week