PeterFogh / dvc_dask_use_case
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
☆23Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for dvc_dask_use_case
- Dvc + Streamlit = ❤️☆40Updated last year
- A walkthrough of essential DVC features (including tutorial text as well as a working environment).☆17Updated 2 years ago
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆142Updated last week
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆82Updated 10 months ago
- ☆28Updated 2 years ago
- DVC support for Airflow workflows☆6Updated 2 years ago
- ☆20Updated 2 years ago
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆167Updated this week
- A collection of helpers for Jupyter/IPython☆47Updated 3 years ago
- A proof of concept library for generating and running machine learning model tests☆13Updated 4 years ago
- Easy to use util for profiling in production☆11Updated last year
- A Python implementation of LightFM, a hybrid recommendation algorithm.☆14Updated 7 years ago
- Using MLflow with a PostgreSQL Database Tracking URI and a Minio Artifact URI, and MLflow Registry☆12Updated 4 years ago
- Benchmarks for DVC☆20Updated this week
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆52Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 2 years ago
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆26Updated last year
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 3 years ago
- A barebones (Distil)BERT pipeline for token classification tasks driven by catalyst☆13Updated 5 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Updated 3 years ago
- Data sets and ML models versioning example from DVC get started☆9Updated 5 months ago
- A library helping to gather stats and run checks during training deep learning models with Pytorch☆36Updated 2 years ago
- Catalyst.Segmentation☆28Updated 3 years ago
- Convenient DL serving☆72Updated 3 years ago
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆57Updated 3 years ago
- Altair backend for pandas plotting☆102Updated 3 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 2 months ago
- Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.☆48Updated this week