PeterFogh / dvc_dask_use_caseLinks
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
☆22Updated 6 years ago
Alternatives and similar repositories for dvc_dask_use_case
Users that are interested in dvc_dask_use_case are comparing it to the libraries listed below
Sorting:
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆184Updated last week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆155Updated last week
- Dvc + Streamlit = ❤️☆40Updated 2 years ago
- ☆27Updated 3 years ago
- Tools for MLflow☆40Updated last year
- Easy to use util for profiling in production☆11Updated 2 years ago
- yogadl, the flexible data layer☆74Updated 2 years ago
- A walkthrough of essential DVC features (including tutorial text as well as a working environment).☆17Updated 3 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated last year
- A library helping to gather stats and run checks during training deep learning models with Pytorch☆35Updated 3 years ago
- Simple utility that retrieves current Jupyter notebook filename or path, when run from Jupyter notebook.☆64Updated 4 months ago
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆141Updated 2 years ago
- 💫 PyScaffold extension for data-science projects☆158Updated 2 weeks ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆51Updated 4 years ago
- DVC's data management subsystem☆18Updated last week
- A proof of concept library for generating and running machine learning model tests☆13Updated 5 years ago
- Deploy MLflow with HTTP basic authentication using Docker☆104Updated last month
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆59Updated 4 years ago
- Practical active learning in python☆191Updated 3 years ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- Inline data annotator for Jupyter notebooks☆180Updated last week
- COVID-19 Python Flask API with real-time data from Wikipedia☆22Updated 3 years ago
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc☆391Updated last year
- Allow parsing Russian receipts☆53Updated 5 years ago
- Clean up the public namespace of your package!☆57Updated 6 months ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Updated 4 years ago
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆67Updated 2 years ago
- A collection of inference modules for fastai2☆90Updated 3 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- Altair backend for pandas plotting☆104Updated 4 years ago