Minyus / Python_Packages_for_Pipeline_WorkflowLinks
This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.
☆57Updated 4 years ago
Alternatives and similar repositories for Python_Packages_for_Pipeline_Workflow
Users that are interested in Python_Packages_for_Pipeline_Workflow are comparing it to the libraries listed below
Sorting:
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v…☆22Updated 2 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more☆228Updated last year
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- ☆20Updated 2 years ago
- ☆44Updated 2 years ago
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Docker image for high-performance Machine Learning web applications. With Uvicorn managed by Gunicorn in Python 3.7 and 3.6, using Conda,…☆67Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆27Updated 2 years ago
- SciKIt-learn Pipeline in PAndas☆42Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- 💫 PyScaffold extension for data-science projects☆159Updated 2 months ago
- An abstraction layer for parameter tuning☆35Updated 9 months ago
- ByteHub: making feature stores simple☆60Updated 4 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- The fast.ai data ethics course☆16Updated 2 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated 9 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- MLOps Python Library☆119Updated 3 years ago
- Templates for your Kedro projects.☆76Updated this week
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆121Updated 5 months ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Sample projects using Ploomber.☆86Updated last year
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Docker images for dask☆241Updated last month
- JupyterHub extension for ContainDS Dashboards☆201Updated 10 months ago