apache / airflowLinks
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆41,186Updated this week
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,555Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,378Updated 2 months ago
- Apache Superset is a Data Visualization and Data Exploration Platform☆67,245Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆11,139Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,825Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,223Updated this week
- Always know what to expect from your data.☆10,586Updated this week
- The Metadata Platform for your Data and AI Stack☆10,853Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,409Updated last week
- ClickHouse® is a real-time analytics database management system☆41,902Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,771Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆41,496Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,727Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆18,937Updated this week
- Parallel computing with task scheduling☆13,360Updated this week
- Data-Centric Pipelines and Data Versioning☆6,243Updated 5 months ago
- An orchestration platform for the development, production, and observation of data assets.☆13,636Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆42,922Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,411Updated 4 months ago
- Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.☆11,643Updated this week
- Mirror of Apache Kafka☆30,517Updated this week
- Distributed Task Queue (development branch)☆26,851Updated last week
- MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.☆53,904Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆69,097Updated this week
- JuiceFS is a distributed POSIX file system built on top of Redis and S3.☆11,943Updated this week
- The Prometheus monitoring system and time series database.☆59,555Updated last week
- 🦍 The Cloud-Native API Gateway and AI Gateway.☆41,363Updated this week
- Machine Learning Toolkit for Kubernetes☆15,093Updated this week
- Build, Manage and Deploy AI/ML Systems☆9,254Updated this week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,035Updated 2 months ago