Minyus / Python_Packages_for_Pipeline_WorkflowLinks
This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.
☆57Updated 5 years ago
Alternatives and similar repositories for Python_Packages_for_Pipeline_Workflow
Users that are interested in Python_Packages_for_Pipeline_Workflow are comparing it to the libraries listed below
Sorting:
- PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more☆229Updated last year
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- Examples of data science projects created with Kedro.☆173Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆82Updated last year
- Start a data science project with modern tools☆200Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common fun…☆216Updated 4 years ago
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v…☆22Updated 2 years ago
- ☆44Updated 2 years ago
- Summarise and explore Pandas DataFrames☆98Updated 5 years ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆28Updated 2 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆82Updated 3 years ago
- 💫 PyScaffold extension for data-science projects☆159Updated 2 weeks ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆226Updated 5 years ago
- A small python library that can clump lists of data together.☆150Updated 3 years ago
- 🍦 Deployment tool for online machine learning models☆97Updated 3 years ago
- Dockerized ML Cookiecutter☆75Updated 2 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- Decorators that logs stats.☆113Updated 5 months ago
- Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores☆100Updated 3 months ago
- A bit of extra usability for sqlalchemy v2.☆78Updated last year
- Cookiecutter template for data scientists working with Docker containers☆358Updated 3 years ago
- A library for recording and reading data in notebooks.☆294Updated 3 years ago
- Data Analysis Baseline Library☆133Updated 10 months ago
- Easy to use test framework for Jupyter Notebooks☆310Updated 3 years ago
- The easy way to write your own flavor of Pandas☆308Updated 2 weeks ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark☆76Updated 2 years ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆71Updated 3 months ago