apache / airflowLinks
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆43,668Updated this week
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,611Updated 7 months ago
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,039Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,196Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,425Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,118Updated 2 weeks ago
- Apache Superset is a Data Visualization and Data Exploration Platform☆69,598Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆14,676Updated this week
- Mirror of Apache Kafka☆31,600Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,604Updated last week
- Always know what to expect from your data.☆11,039Updated last week
- Apache Druid: a high performance real-time analytics database.☆13,908Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,318Updated this week
- Parallel computing with task scheduling☆13,679Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆42,553Updated this week
- Scalable datastore for metrics, events, and real-time analytics☆31,035Updated last week
- Workflow Engine for Kubernetes☆16,319Updated last week
- Machine Learning Toolkit for Kubernetes☆15,378Updated last week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆45,430Updated this week
- Write scalable load tests in plain Python 🚗💨☆27,286Updated this week
- TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.☆39,524Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆71,472Updated this week
- NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB☆15,182Updated this week
- CNCF Jaeger, a Distributed Tracing Platform☆22,279Updated this week
- Distributed Task Queue (development branch)☆27,788Updated last week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,497Updated 2 months ago
- ClickHouse® is a real-time analytics database management system☆44,962Updated this week
- Cloud-native high-performance edge/middle/service proxy☆27,248Updated this week
- A library that provides an embeddable, persistent key-value store for fast storage.☆31,276Updated last week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆20,366Updated this week
- CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placemen…☆31,654Updated this week