Minyus / Python_Packages_for_Pipeline_Workflow
This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.
β57Updated 4 years ago
Alternatives and similar repositories for Python_Packages_for_Pipeline_Workflow:
Users that are interested in Python_Packages_for_Pipeline_Workflow are comparing it to the libraries listed below
- PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and moreβ228Updated last year
- π¦ Deployment tool for online machine learning modelsβ97Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAMβ84Updated last year
- A scikit-learn compatible estimator based on business-rules with interactive dashboard includedβ28Updated 3 years ago
- Automated Data Science and Machine Learning library to optimize workflow.β104Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectationsβ53Updated 2 years ago
- π Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projectsβ81Updated 3 years ago
- Summarise and explore Pandas DataFramesβ98Updated 4 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distrβ¦β120Updated 3 months ago
- Decorators that logs stats.β110Updated last month
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ34Updated 4 years ago
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?vβ¦β22Updated 2 years ago
- Dockerized ML Cookiecutterβ73Updated 2 years ago
- β43Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.β77Updated last year
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.β35Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsβ106Updated last year
- Primrose modeling framework for simple production modelsβ32Updated last year
- An abstraction layer for parameter tuningβ35Updated 7 months ago
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common funβ¦β216Updated 3 years ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.β27Updated 2 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn frβ¦β57Updated 3 years ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.β116Updated 11 months ago
- Pipeline components that support partial_fit.β46Updated 9 months ago
- A collection of machine learning model cards and datasheets.β75Updated 10 months ago
- Data Analysis Baseline Libraryβ131Updated 6 months ago
- Projects developed by Domino's R&D teamβ76Updated 3 years ago
- JupyterHub extension for ContainDS Dashboardsβ202Updated 8 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated last year