treeverse / dvc-dataLinks
DVC's data management subsystem
☆18Updated last week
Alternatives and similar repositories for dvc-data
Users that are interested in dvc-data are comparing it to the libraries listed below
Sorting:
- ☆27Updated 3 years ago
- Benchmarks for DVC☆22Updated this week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆158Updated last month
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆185Updated last week
- Seamlessly integrate numpy arrays into pydantic models.☆59Updated 3 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆57Updated 7 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- simple, flexible, offline capable, cloud storage with a Python path-like interface☆175Updated 9 months ago
- A mini dashboard to help find slow tests in pytest.☆83Updated last year
- Decorators that logs stats.☆115Updated 10 months ago
- Typed wrappers over pandas DataFrames with schema validation☆102Updated 2 years ago
- Accompanies the uncool MLOps workshop☆26Updated 3 years ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated last month
- Dvc + Streamlit = ❤️☆40Updated 2 years ago
- A simple extension for Jupyter Notebook and Jupyter Lab to beautify Python code automatically using Black. Fork of dnanhkhoa/nb_black.☆59Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 3 months ago
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆22Updated 6 years ago
- A proof-of-concept for a RAG to query the scikit-learn documentation☆29Updated 5 months ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated 4 months ago
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆142Updated 2 years ago
- Run pytest against markdown files/docstrings.☆157Updated 2 months ago
- Feature engineering library that helps you keep track of feature dependencies, documentation and schema☆28Updated 4 years ago
- ☆30Updated 4 years ago
- Test suite for Python array API standard compliance☆70Updated last week
- Deploy MLflow with HTTP basic authentication using Docker☆104Updated 3 weeks ago
- Time based splits for cross validation☆39Updated this week
- 📖 Documentation for Nebari☆21Updated this week