A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
β6,535Mar 5, 2026Updated this week
Alternatives and similar repositories for awesome-pipeline
Users that are interested in awesome-pipeline are comparing it to the libraries listed below
Sorting:
- A curated list of awesome ETL frameworks, libraries, and software.β3,521Jul 23, 2024Updated last year
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support πβ1,480Jan 8, 2026Updated 2 months ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,683Feb 28, 2026Updated last week
- A curated list of awesome open source workflow enginesβ7,703Feb 13, 2026Updated 3 weeks ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ44,510Updated this week
- An orchestration platform for the development, production, and observation of data assets.β15,049Mar 3, 2026Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,782Updated this week
- A DSL for data-driven computational pipelinesβ3,316Updated this week
- a curated list of awesome streaming frameworks, applications, etcβ2,954Feb 9, 2026Updated last month
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflowβ2,086Dec 15, 2023Updated 2 years ago
- Parallel computing with task schedulingβ13,754Mar 2, 2026Updated last week
- Data-Centric Pipelines and Data Versioningβ6,287Feb 3, 2025Updated last year
- Build, Manage and Deploy AI/ML Systemsβ9,903Updated this week
- π Parameterize, execute, and analyze notebooksβ6,390Feb 27, 2026Updated last week
- π¦ Data Versioning and ML Experimentsβ15,421Mar 2, 2026Updated last week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,771Feb 26, 2026Updated last week
- A curated list of nextflow based pipelinesβ622Jun 23, 2025Updated 8 months ago
- Always know what to expect from your data.β11,224Updated this week
- A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).β926Mar 2, 2026Updated last week
- Curated list of resources about Apache Airflowβ3,896Jan 30, 2026Updated last month
- Apache Superset is a Data Visualization and Data Exploration Platformβ70,755Mar 2, 2026Updated last week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,833Updated this week
- A curated list of data engineering tools for software developersβ8,342Feb 21, 2026Updated 2 weeks ago
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,363Feb 10, 2026Updated 3 weeks ago
- A curated list of awesome Bioinformatics libraries and software.β3,882Feb 28, 2026Updated last week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β12,345Updated this week
- This is the development home of the workflow management system Snakemake. For general information, seeβ2,718Feb 27, 2026Updated last week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,617Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,255Mar 2, 2026Updated last week
- A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.β24,248Mar 26, 2024Updated last year
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,543Sep 4, 2024Updated last year
- Workflow Engine for Kubernetesβ16,495Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.β28,140Mar 1, 2026Updated last week
- Python Stream Processingβ6,828Jul 27, 2024Updated last year
- the portable Python dataframe libraryβ6,440Updated this week
- A curated list of awesome big data frameworks, ressources and other awesomeness.β14,265Feb 5, 2026Updated last month
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,483Mar 1, 2026Updated last week
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,200Mar 2, 2026Updated last week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interactingβ¦β4,744Mar 1, 2026Updated last week