apache / airflowLinks
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆40,193Updated this week
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,304Updated 2 weeks ago
- Machine Learning Toolkit for Kubernetes☆14,998Updated last month
- Apache Superset is a Data Visualization and Data Exploration Platform☆66,416Updated this week
- Distributed Task Queue (development branch)☆26,420Updated last week
- Scrapy, a fast high-level web crawling & scraping framework for Python.☆55,337Updated last week
- Workflow Engine for Kubernetes☆15,655Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,356Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,120Updated this week
- Run Kubernetes locally☆30,462Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,865Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆42,113Updated this week
- Production-Grade Container Scheduling and Management☆115,405Updated this week
- Parallel computing with task scheduling☆13,237Updated last week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,350Updated last week
- The Prometheus monitoring system and time series database.☆58,755Updated this week
- 🦍 The Cloud-Native API Gateway and AI Gateway.☆40,914Updated this week
- Build, Manage and Deploy AI/ML Systems☆8,838Updated last week
- AWS SDK for Python☆9,359Updated this week
- Lightweight Kubernetes☆29,733Updated last week
- The Kubernetes Package Manager☆27,935Updated this week
- The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Pro…☆68,216Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,359Updated this week
- Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies A…☆45,261Updated this week
- A time-series database for high-performance real-time analytics packaged as a Postgres extension☆19,164Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,718Updated this week
- Open source platform for the machine learning lifecycle☆20,638Updated this week
- A tool for secrets management, encryption as a service, and privileged access management☆32,458Updated this week
- Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects,…☆45,532Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,481Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆13,247Updated this week