iterative / dvc-dataLinks
DVC's data management subsystem
☆18Updated this week
Alternatives and similar repositories for dvc-data
Users that are interested in dvc-data are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- Benchmarks for DVC☆21Updated this week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆148Updated last week
- Accompanies the uncool MLOps workshop☆26Updated 3 years ago
- Seamlessly integrate numpy arrays into pydantic models.☆58Updated 2 years ago
- A mini dashboard to help find slow tests in pytest.☆80Updated 11 months ago
- SCM wrapper and fsspec filesystem for Git for use in DVC.☆22Updated this week
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆177Updated this week
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆23Updated 6 years ago
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- A very simple "hello world" project for deploying Prefect 2 to a docker container on Google Compute Engine.☆11Updated 2 years ago
- A pytest plugin for regression testing and regenerating Jupyter Notebooks☆52Updated this week
- Placeholder for the opensource Grid AI components☆44Updated 3 years ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Automate issue discovery for your projects against Lightning nightly and releases.☆46Updated last month
- Prefect integrations for working with Docker☆43Updated last year
- Run pytest against markdown files/docstrings.☆120Updated 8 months ago
- A collection of self-contained fsspec-based filesystems☆17Updated this week
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆59Updated 4 years ago
- Fast approximate joins on string columns for polars dataframes.☆13Updated 8 months ago
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- Cluster tools for running Dask on Databricks☆14Updated last year
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆35Updated 3 years ago
- simple, flexible, offline capable, cloud storage with a Python path-like interface☆173Updated 2 months ago
- An MkDocs extension to generate documentation for Typer command line applications☆30Updated 2 years ago
- Extension to hypothesis for testing numpy general universal functions☆39Updated 4 years ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schema☆28Updated 3 years ago
- Convert pyproject.toml to environment.yaml☆131Updated 2 years ago
- Clean up the public namespace of your package!☆56Updated last month