apache / airflowLinks
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆40,585Updated this week
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,343Updated last month
- Apache Superset is a Data Visualization and Data Exploration Platform☆66,704Updated this week
- Docker Apache Airflow☆3,806Updated 2 years ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,579Updated this week
- Parallel computing with task scheduling☆13,277Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,429Updated 2 weeks ago
- Streamlit — A faster way to build and share data apps.☆39,907Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆13,408Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,378Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,550Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,990Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,159Updated this week
- ClickHouse® is a real-time analytics database management system☆41,284Updated this week
- Distributed Task Queue (development branch)☆26,612Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆41,321Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,744Updated this week
- The Prometheus monitoring system and time series database.☆58,982Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆68,484Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆27,707Updated 2 weeks ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆37,558Updated this week
- Python packaging and dependency management made easy☆33,338Updated this week
- Always know what to expect from your data.☆10,480Updated this week
- Python client for Apache Kafka☆5,759Updated this week
- MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.☆53,171Updated last week
- Static Type Checker for Python☆14,447Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆42,351Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,088Updated this week
- 🦉 Data Versioning and ML Experiments☆14,563Updated this week
- Scalable datastore for metrics, events, and real-time analytics☆30,173Updated this week
- Apache Iceberg☆7,584Updated this week