pditommaso / awesome-pipelineView external linksLinks
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
β6,525Feb 9, 2026Updated last week
Alternatives and similar repositories for awesome-pipeline
Users that are interested in awesome-pipeline are comparing it to the libraries listed below
Sorting:
- A curated list of awesome ETL frameworks, libraries, and software.β3,518Jul 23, 2024Updated last year
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support πβ1,477Jan 8, 2026Updated last month
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,662Updated this week
- A curated list of awesome open source workflow enginesβ7,659Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ44,274Updated this week
- An orchestration platform for the development, production, and observation of data assets.β14,930Feb 9, 2026Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,607Updated this week
- A DSL for data-driven computational pipelinesβ3,299Updated this week
- a curated list of awesome streaming frameworks, applications, etcβ2,953Feb 9, 2026Updated last week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflowβ2,086Dec 15, 2023Updated 2 years ago
- Parallel computing with task schedulingβ13,738Feb 5, 2026Updated last week
- Data-Centric Pipelines and Data Versioningβ6,287Feb 3, 2025Updated last year
- Build, Manage and Deploy AI/ML Systemsβ9,753Updated this week
- π Parameterize, execute, and analyze notebooksβ6,373Jan 5, 2026Updated last month
- π¦ Data Versioning and ML Experimentsβ15,367Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,756Updated this week
- A curated list of nextflow based pipelinesβ621Jun 23, 2025Updated 7 months ago
- Always know what to expect from your data.β11,133Feb 9, 2026Updated last week
- A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).β927Updated this week
- Curated list of resources about Apache Airflowβ3,885Jan 30, 2026Updated 2 weeks ago
- Apache Superset is a Data Visualization and Data Exploration Platformβ70,505Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,715Feb 10, 2026Updated last week
- A curated list of data engineering tools for software developersβ8,281Feb 10, 2026Updated last week
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,357Updated this week
- A curated list of awesome Bioinformatics libraries and software.β3,852Mar 21, 2025Updated 10 months ago
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β12,250Updated this week
- This is the development home of the workflow management system Snakemake. For general information, seeβ2,703Feb 5, 2026Updated last week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,259Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,221Updated this week
- A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.β24,228Mar 26, 2024Updated last year
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,541Sep 4, 2024Updated last year
- Workflow Engine for Kubernetesβ16,451Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.β28,108Feb 1, 2026Updated 2 weeks ago
- Python Stream Processingβ6,836Jul 27, 2024Updated last year
- A curated list of awesome big data frameworks, ressources and other awesomeness.β14,225Feb 5, 2026Updated last week
- the portable Python dataframe libraryβ6,397Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,468Feb 5, 2026Updated last week
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,123Feb 9, 2026Updated last week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interactingβ¦β4,738Feb 9, 2026Updated last week