apache / airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆38,298Updated this week
Alternatives and similar repositories for airflow:
Users that are interested in airflow are comparing it to the libraries listed below
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,024Updated this week
- The Prometheus monitoring system and time series database.☆56,847Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆12,300Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,155Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platform☆63,827Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆40,326Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆18,036Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆14,846Updated this week
- An Open Source Machine Learning Framework for Everyone☆187,272Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆65,917Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,593Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆26,754Updated this week
- Docker Apache Airflow☆3,790Updated last year
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,755Updated this week
- Always know what to expect from your data.☆10,117Updated this week
- ClickHouse® is a real-time analytics database management system☆38,472Updated this week
- A library that provides an embeddable, persistent key-value store for fast storage.☆28,975Updated this week
- Open Source AI/ML Platform☆8,446Updated this week
- MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.☆49,484Updated this week
- Mirror of Apache Kafka☆29,248Updated this week
- Like Prometheus, but for logs.☆24,373Updated this week
- Parallel computing with task scheduling☆12,851Updated this week
- TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.☆37,734Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆34,903Updated this week
- Machine Learning Toolkit for Kubernetes☆14,544Updated last month
- high-performance graph database for real-time use cases☆20,582Updated this week
- Distributed reliable key-value store for the most critical data of a distributed system☆48,212Updated this week
- A time-series database for high-performance real-time analytics packaged as a Postgres extension☆18,199Updated this week
- Logstash - transport and process your logs, events, or other data☆14,316Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,231Updated this week