iterative / dvc-dataLinks
DVC's data management subsystem
☆18Updated this week
Alternatives and similar repositories for dvc-data
Users that are interested in dvc-data are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- Benchmarks for DVC☆21Updated last week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆153Updated this week
- Seamlessly integrate numpy arrays into pydantic models.☆58Updated 2 years ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆181Updated this week
- A mini dashboard to help find slow tests in pytest.☆83Updated last year
- simple, flexible, offline capable, cloud storage with a Python path-like interface☆174Updated 4 months ago
- Coming soon☆62Updated last year
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated last week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆144Updated 2 weeks ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- Run pytest against markdown files/docstrings.☆128Updated last month
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- Synchronicity lets you interoperate with asynchronous Python APIs.☆123Updated last month
- Test suite for Python array API standard compliance☆68Updated last month
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆139Updated last year
- A pytest plugin to create benchmarks☆102Updated last month
- Feature engineering library that helps you keep track of feature dependencies, documentation and schema☆28Updated 3 years ago
- ☆17Updated last year
- Accompanies the uncool MLOps workshop☆26Updated 3 years ago
- A plugin for Flake8 that checks pandas code☆170Updated 2 years ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆142Updated 2 weeks ago
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆59Updated 4 years ago
- A data modelling layer built on top of polars and pydantic☆198Updated 2 years ago
- 📖 Documentation for Nebari☆16Updated last week
- Decorators that logs stats.☆114Updated 6 months ago
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆23Updated 6 years ago
- Lint Cython files☆88Updated last week
- IbisML is a library for building scalable ML pipelines using Ibis.☆115Updated last month