A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
β6,588Apr 17, 2026Updated 2 months ago
Alternatives and similar repositories for awesome-pipeline
Users that are interested in awesome-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of awesome ETL frameworks, libraries, and software.β3,561May 1, 2026Updated last month
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support πβ1,480Jan 8, 2026Updated 5 months ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,743Updated this week
- A curated list of awesome open source workflow enginesβ7,841Apr 6, 2026Updated 2 months ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ45,788Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A DSL for data-driven computational pipelinesβ3,415Updated this week
- An orchestration platform for the development, production, and observation of data assets.β15,699Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,598Updated this week
- a curated list of awesome streaming frameworks, applications, etcβ2,986Updated this week
- A curated list of nextflow based pipelinesβ627Jun 23, 2025Updated 11 months ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflowβ2,086Dec 15, 2023Updated 2 years ago
- Data-Centric Pipelines and Data Versioningβ6,291Feb 3, 2025Updated last year
- Parallel computing with task schedulingβ13,846Updated this week
- A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).β932Updated this week
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- π Parameterize, execute, and analyze notebooksβ6,450May 12, 2026Updated last month
- A curated list of awesome Bioinformatics libraries and software.β4,112Apr 7, 2026Updated 2 months ago
- π¦ Data Versioning and ML Experimentsβ15,675Jun 8, 2026Updated last week
- Build, Manage and Deploy AI/ML Systemsβ10,129Jun 11, 2026Updated last week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,887Updated this week
- This is the development home of the workflow management system Snakemake. For general information, seeβ2,814Jun 11, 2026Updated last week
- Always know what to expect from your data.β11,556Updated this week
- Curated list of resources about Apache Airflowβ3,921May 7, 2026Updated last month
- A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.β24,306Mar 26, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Apache Superset is a Data Visualization and Data Exploration Platformβ73,298Updated this week
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,388Feb 10, 2026Updated 4 months ago
- A curated list of data engineering tools for software developersβ8,741Updated this week
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.β7,088Updated this week
- Specification for the Workflow Description Language (WDL).β852Updated this week
- A light-weight wrapper library around Spotify's Luigi workflow library to make writing scientific workflows more fluent, flexible and modβ¦β335Dec 10, 2024Updated last year
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β12,990Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.β28,201Apr 1, 2026Updated 2 months ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β42,855Updated this week
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Python Stream Processingβ6,824Jul 27, 2024Updated last year
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,532Sep 4, 2024Updated last year
- Workflow Engine for Kubernetesβ16,763Updated this week
- the portable Python dataframe libraryβ6,573Updated this week
- A curated list of awesome big data frameworks, ressources and other awesomeness.β14,435May 19, 2026Updated 3 weeks ago
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,636Jun 1, 2026Updated 2 weeks ago
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,635Jun 4, 2026Updated last week