apache / airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆39,674Updated this week
Alternatives and similar repositories for airflow:
Users that are interested in airflow are comparing it to the libraries listed below
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,224Updated 2 months ago
- Docker Apache Airflow☆3,803Updated 2 years ago
- An orchestration platform for the development, production, and observation of data assets.☆12,954Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platform☆65,726Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆18,950Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,655Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,675Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆40,957Updated this week
- Curated list of resources about Apache Airflow☆3,768Updated 7 months ago
- Python Development Workflow for Humans.☆25,035Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,223Updated this week
- Always know what to expect from your data.☆10,324Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,252Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆27,571Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,475Updated this week
- Open source platform for the machine learning lifecycle☆20,155Updated this week
- Python packaging and dependency management made easy☆33,008Updated this week
- The Python micro framework for building web applications.☆69,340Updated 2 weeks ago
- Blazing fast, instant realtime GraphQL APIs on all your data with fine grained access control, also trigger webhooks on database events.☆31,471Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,081Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,947Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆8,788Updated this week
- Parallel computing with task scheduling☆13,125Updated this week
- Connect, secure, control, and observe services.☆36,741Updated this week
- 🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & mor…☆35,299Updated 4 months ago
- Developer-first error tracking and performance monitoring☆40,576Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,272Updated this week
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆31,410Updated last week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,556Updated 2 weeks ago
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆17,860Updated this week