PeterFogh / dvc_dask_use_caseLinks
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
β22Updated 6 years ago
Alternatives and similar repositories for dvc_dask_use_case
Users that are interested in dvc_dask_use_case are comparing it to the libraries listed below
Sorting:
- π·οΈ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.β158Updated last month
- π Log and track ML metrics, parameters, models with Git and/or DVCβ185Updated this week
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAMβ86Updated 2 years ago
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvcβ391Updated last year
- β27Updated 3 years ago
- Dvc + Streamlit = β€οΈβ40Updated 2 years ago
- A walkthrough of essential DVC features (including tutorial text as well as a working environment).β17Updated 3 years ago
- π« PyScaffold extension for data-science projectsβ160Updated 3 weeks ago
- A proof of concept library for generating and running machine learning model testsβ13Updated 5 years ago
- A collection of helpers for Jupyter/IPythonβ48Updated 4 years ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.β51Updated 5 years ago
- spock is a framework that helps manage complex parameter configurations during research and development of Python applicationsβ142Updated 2 years ago
- yogadl, the flexible data layerβ74Updated 2 years ago
- Practical active learning in pythonβ191Updated 3 years ago
- Deploy MLflow with HTTP basic authentication using Dockerβ104Updated 3 weeks ago
- Easy to use util for profiling in productionβ11Updated 2 years ago
- Inline data annotator for Jupyter notebooksβ181Updated 2 months ago
- Tools for MLflowβ40Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era.β205Updated 5 years ago
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.β59Updated 4 years ago
- HiPlot fetcher for experiments logged with MLflowβ14Updated 3 years ago
- Altair backend for pandas plottingβ104Updated 4 years ago
- Jupyter Widget for data annotationβ140Updated 3 years ago
- A simple wrapper over `pydot` and `graphviz` which fixes some sharp edgesβ63Updated 3 years ago
- GitHub Action for testing notebooksβ151Updated 4 years ago
- Export and import MLflow experiments, runs or registered modelsβ80Updated 3 years ago
- DVC's data management subsystemβ18Updated 2 weeks ago
- A library helping to gather stats and run checks during training deep learning models with Pytorchβ35Updated 3 years ago
- Home of the PipeGraph extension to Scikit-Learnβ24Updated 10 months ago