iterative / dvc-dataLinks
DVC's data management subsystem
☆18Updated this week
Alternatives and similar repositories for dvc-data
Users that are interested in dvc-data are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆153Updated this week
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆181Updated this week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated this week
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- Seamlessly integrate numpy arrays into pydantic models.☆58Updated 2 years ago
- Accompanies the uncool MLOps workshop☆26Updated 3 years ago
- Decorators that logs stats.☆115Updated 7 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆55Updated 3 months ago
- Coming soon☆62Updated last year
- Runs black on code cells in a Jupyter notebook☆50Updated 3 years ago
- A client library for executing notebooks. Formally nbconvert's ExecutePreprocessor☆178Updated 2 months ago
- HiPlot fetcher for experiments logged with MLflow☆14Updated 3 years ago
- Benchmarks for DVC☆21Updated this week
- A simple extension for Jupyter Notebook and Jupyter Lab to beautify Python code automatically using Black. Fork of dnanhkhoa/nb_black.☆59Updated last year
- simple, flexible, offline capable, cloud storage with a Python path-like interface☆174Updated 5 months ago
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆139Updated last year
- 💫 PyScaffold extension for data-science projects☆159Updated this week
- Dvc + Streamlit = ❤️☆40Updated last year
- A proof-of-concept for a RAG to query the scikit-learn documentation☆27Updated last month
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- Cluster tools for running Dask on Databricks☆14Updated last year
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆23Updated 6 years ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆142Updated this week
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- A plugin for Flake8 that checks pandas code☆170Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- ☆17Updated last year
- First-party plugins maintained by the Kedro team.☆106Updated this week