A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
β6,559Mar 16, 2026Updated 2 weeks ago
Alternatives and similar repositories for awesome-pipeline
Users that are interested in awesome-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of awesome ETL frameworks, libraries, and software.β3,529Mar 7, 2026Updated 3 weeks ago
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support πβ1,482Jan 8, 2026Updated 2 months ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,705Mar 18, 2026Updated last week
- A curated list of awesome open source workflow enginesβ7,748Feb 13, 2026Updated last month
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ44,790Updated this week
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A DSL for data-driven computational pipelinesβ3,336Updated this week
- An orchestration platform for the development, production, and observation of data assets.β15,134Mar 20, 2026Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,969Updated this week
- a curated list of awesome streaming frameworks, applications, etcβ2,962Feb 9, 2026Updated last month
- A curated list of nextflow based pipelinesβ622Jun 23, 2025Updated 9 months ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflowβ2,084Dec 15, 2023Updated 2 years ago
- Parallel computing with task schedulingβ13,778Updated this week
- Data-Centric Pipelines and Data Versioningβ6,290Feb 3, 2025Updated last year
- A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).β927Mar 19, 2026Updated last week
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- π Parameterize, execute, and analyze notebooksβ6,414Mar 16, 2026Updated last week
- A curated list of awesome Bioinformatics libraries and software.β3,910Feb 28, 2026Updated last month
- Build, Manage and Deploy AI/ML Systemsβ9,973Updated this week
- π¦ Data Versioning and ML Experimentsβ15,482Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,799Updated this week
- This is the development home of the workflow management system Snakemake. For general information, seeβ2,734Updated this week
- Always know what to expect from your data.β11,301Updated this week
- Curated list of resources about Apache Airflowβ3,894Jan 30, 2026Updated 2 months ago
- A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.β24,268Mar 26, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Apache Superset is a Data Visualization and Data Exploration Platformβ71,626Updated this week
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,364Feb 10, 2026Updated last month
- A curated list of data engineering tools for software developersβ8,402Feb 21, 2026Updated last month
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β12,475Updated this week
- Specification for the Workflow Description Language (WDL).β847Mar 18, 2026Updated last week
- A light-weight wrapper library around Spotify's Luigi workflow library to make writing scientific workflows more fluent, flexible and modβ¦β335Dec 10, 2024Updated last year
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,877Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.β28,162Mar 1, 2026Updated 3 weeks ago
- Python Stream Processingβ6,823Jul 27, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,545Sep 4, 2024Updated last year
- Workflow Engine for Kubernetesβ16,572Updated this week
- A curated list of awesome big data frameworks, ressources and other awesomeness.β14,294Feb 5, 2026Updated last month
- the portable Python dataframe libraryβ6,466Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,307Mar 19, 2026Updated last week
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,277Mar 23, 2026Updated last week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,498Mar 1, 2026Updated 3 weeks ago