pditommaso/awesome-pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pditommaso/awesome-pipeline)

pditommaso / awesome-pipeline

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

☆6,605

Alternatives and similar repositories for awesome-pipeline

Users that are interested in awesome-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pawl / awesome-etl
View on GitHub
A curated list of awesome ETL frameworks, libraries, and software.
☆3,579May 1, 2026Updated 2 months ago
common-workflow-language / common-workflow-language
View on GitHub
Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊
☆1,481Jan 8, 2026Updated 6 months ago
spotify / luigi
View on GitHub
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…
☆18,754Jul 18, 2026Updated last week
meirwah / awesome-workflow-engines
View on GitHub
A curated list of awesome open source workflow engines
☆7,891Apr 6, 2026Updated 3 months ago
apache / airflow
View on GitHub
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆46,290Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nextflow-io / nextflow
View on GitHub
A DSL for data-driven computational pipelines
☆3,451Updated this week
dagster-io / dagster
View on GitHub
An orchestration platform for the development, production, and observation of data assets.
☆15,911Updated this week
manuzhang / awesome-streaming
View on GitHub
a curated list of awesome streaming frameworks, applications, etc
☆2,999Updated this week
PrefectHQ / prefect
View on GitHub
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆23,507Updated this week
nextflow-io / awesome-nextflow
View on GitHub
A curated list of nextflow based pipelines
☆629Jun 23, 2025Updated last year
mara / mara-pipelines
View on GitHub
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
☆2,089Dec 15, 2023Updated 2 years ago
dask / dask
View on GitHub
Parallel computing with task scheduling
☆13,871Updated this week
pachyderm / pachyderm
View on GitHub
Data-Centric Pipelines and Data Versioning
☆6,299Feb 3, 2025Updated last year
DataBiosphere / toil
View on GitHub
A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).
☆937Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
danielecook / Awesome-Bioinformatics
View on GitHub
A curated list of awesome Bioinformatics libraries and software.
☆4,198Apr 7, 2026Updated 3 months ago
nteract / papermill
View on GitHub
📚 Parameterize, execute, and analyze notebooks
☆6,460Jul 6, 2026Updated 3 weeks ago
Netflix / metaflow
View on GitHub
Build, Manage and Deploy AI/ML Systems
☆10,201Updated this week
treeverse / dvc
View on GitHub
🦉 Data Versioning and ML Experiments
☆15,777Updated this week
kedro-org / kedro
View on GitHub
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…
☆10,937Updated this week
snakemake / snakemake
View on GitHub
This is the development home of the workflow management system Snakemake. For general information, see
☆2,838Updated this week
fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,680Updated this week
jghoman / awesome-apache-airflow
View on GitHub
Curated list of resources about Apache Airflow
☆3,922May 7, 2026Updated 2 months ago
kahun / awesome-sysadmin
View on GitHub
A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.
☆24,331Mar 26, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
apache / superset
View on GitHub
Apache Superset is a Data Visualization and Data Exploration Platform
☆74,026Updated this week
modin-project / modin
View on GitHub
Modin: Scale your Pandas workflows by changing a single line of code
☆10,392Feb 10, 2026Updated 5 months ago
igorbarinov / awesome-data-engineering
View on GitHub
A curated list of data engineering tools for software developers
☆8,902Jul 18, 2026Updated last week
flyteorg / flyte
View on GitHub
Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.
☆7,163Updated this week
pharmbio / sciluigi
View on GitHub
A light-weight wrapper library around Spotify's Luigi workflow library to make writing scientific workflows more fluent, flexible and mod…
☆335Dec 10, 2024Updated last year
openwdl / wdl
View on GitHub
Specification for the Workflow Description Language (WDL).
☆856Updated this week
dbt-labs / dbt-core
View on GitHub
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…
☆13,534Updated this week
ray-project / ray
View on GitHub
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
☆43,374Updated this week
google / python-fire
View on GitHub
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
☆28,211Jul 1, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
robinhood / faust
View on GitHub
Python Stream Processing
☆6,823Jul 27, 2024Updated 2 years ago
argoproj / argo-workflows
View on GitHub
Workflow Engine for Kubernetes
☆16,854Updated this week
airbnb / knowledge-repo
View on GitHub
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
☆5,538Sep 4, 2024Updated last year
oxnr / awesome-bigdata
View on GitHub
A curated list of awesome big data frameworks, ressources and other awesomeness.
☆14,512May 19, 2026Updated 2 months ago
ibis-project / ibis
View on GitHub
the portable Python dataframe library
☆6,613Updated this week
EthicalML / awesome-production-machine-learning
View on GitHub
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
☆20,807Updated this week
getredash / redash
View on GitHub
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
☆28,717Updated this week