pditommaso / awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
β6,346Updated last month
Alternatives and similar repositories for awesome-pipeline:
Users that are interested in awesome-pipeline are comparing it to the libraries listed below
- A curated list of awesome ETL frameworks, libraries, and software.β3,390Updated 9 months ago
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support πβ1,463Updated 4 months ago
- Parallel computing with task schedulingβ13,136Updated last week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,513Updated 7 months ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflowβ2,080Updated last year
- Data-Centric Pipelines and Data Versioningβ6,222Updated 2 months ago
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,116Updated last week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,234Updated 2 months ago
- Curated list of resources about Apache Airflowβ3,770Updated 8 months ago
- the portable Python dataframe libraryβ5,705Updated this week
- a curated list of awesome streaming frameworks, applications, etcβ2,801Updated last month
- π Parameterize, execute, and analyze notebooksβ6,137Updated 2 weeks ago
- Build, Manage and Deploy AI/ML Systemsβ8,742Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,371Updated 6 months ago
- Extract Transform Load for Python 3.5+β1,590Updated last year
- NumPy and Pandas interface to Big Dataβ3,199Updated last year
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scriptsβ6,814Updated this week
- A curated list of awesome Jupyter projects, libraries and resourcesβ4,209Updated this week
- A curated list of awesome open source workflow enginesβ6,987Updated 3 months ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrowβ2,744Updated 3 years ago
- Docker Apache Airflowβ3,803Updated 2 years ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,503Updated this week
- π¦ Data Versioning and ML Experimentsβ14,402Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β10,685Updated this week
- A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.β909Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interactingβ¦β4,557Updated 3 weeks ago
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,183Updated this week
- The Open Source Feature Store for AI/MLβ5,975Updated this week
- π© Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.β¦β6,601Updated 2 months ago
- A functional standard library for Python.β4,839Updated 3 months ago