treeverse / dvc-dataLinks
DVC's data management subsystem
☆18Updated 2 weeks ago
Alternatives and similar repositories for dvc-data
Users that are interested in dvc-data are comparing it to the libraries listed below
Sorting:
- ☆27Updated 3 years ago
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆158Updated 3 weeks ago
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆184Updated 2 weeks ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated last year
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆56Updated 6 months ago
- Benchmarks for DVC☆21Updated 2 weeks ago
- Seamlessly integrate numpy arrays into pydantic models.☆59Updated 3 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 2 months ago
- Accompanies the uncool MLOps workshop☆26Updated 3 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- Deploy MLflow with HTTP basic authentication using Docker☆104Updated 2 months ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Run pytest against markdown files/docstrings.☆154Updated 2 months ago
- Dataset registry DVC project☆85Updated last year
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆59Updated 4 years ago
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆142Updated 2 years ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated this week
- A plugin for Flake8 that checks pandas code☆170Updated 2 years ago
- ☆31Updated 2 years ago
- Coming soon☆62Updated 2 years ago
- First-party plugins maintained by the Kedro team.☆112Updated this week
- A mini dashboard to help find slow tests in pytest.☆83Updated last year
- Dvc + Streamlit = ❤️☆40Updated 2 years ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆139Updated 2 weeks ago
- Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores☆100Updated 3 months ago
- A proof-of-concept for a RAG to query the scikit-learn documentation☆28Updated 4 months ago
- 💫 PyScaffold extension for data-science projects☆159Updated last month
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆22Updated 6 years ago