pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
☆146Apr 13, 2026Updated last month
Alternatives and similar repositories for pyDVL
Users that are interested in pyDVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆101Feb 4, 2025Updated last year
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆143Feb 20, 2025Updated last year
- [CVPR 2021] Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?☆34Dec 26, 2020Updated 5 years ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Jul 26, 2023Updated 2 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Nov 10, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆81Oct 3, 2024Updated last year
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆54Jun 5, 2024Updated last year
- A library for calibrating classifiers and computing calibration metrics☆14Nov 28, 2022Updated 3 years ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆194Updated this week
- A fire-tested template for production grade python libraries and packages.☆18Jul 15, 2025Updated 10 months ago
- Data Shapley: Equitable Valuation of Data for Machine Learning☆295May 1, 2024Updated 2 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆123Mar 24, 2026Updated 2 months ago
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆28Jan 21, 2022Updated 4 years ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆88Feb 27, 2024Updated 2 years ago
- A simple PyTorch implementation of influence functions.☆95Jun 17, 2024Updated last year
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆27Feb 21, 2020Updated 6 years ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆31May 1, 2024Updated 2 years ago
- A Python Data Valuation Package☆35Feb 3, 2023Updated 3 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆45Aug 26, 2025Updated 9 months ago
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆26Oct 13, 2024Updated last year
- Software design principles for machine learning applications☆380Aug 19, 2025Updated 9 months ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆347Oct 29, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆14Mar 20, 2021Updated 5 years ago
- ☆11May 11, 2022Updated 4 years ago
- Efficient approximation algorithms for Shapley Values in Horizontal Enterprise Federated Learning☆13May 5, 2020Updated 6 years ago
- Training-free data valuation on deep neural network applications. (ICML-2022)☆27Jul 13, 2022Updated 3 years ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆18Sep 15, 2023Updated 2 years ago
- ☆32May 24, 2023Updated 3 years ago
- A python library to discover and mitigate biases in machine learning models and datasets☆20Jul 6, 2023Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 3 years ago
- ☆13Feb 24, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AI Logging for Interpretability and Explainability🔬☆144Jun 7, 2024Updated last year
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Dec 30, 2022Updated 3 years ago
- Pairwise Difference Learning (PDL) is a meta-learning framework that leverages pairwise differences to transform multiclass problems into…☆22Aug 27, 2025Updated 9 months ago
- Reading list for "The Shapley Value in Machine Learning" (JCAI 2022)☆154Aug 8, 2022Updated 3 years ago
- Mixture of Decision Trees for Interpretable Machine Learning☆11Sep 2, 2021Updated 4 years ago
- ☆13Feb 25, 2022Updated 4 years ago
- Analysis of evidential models☆15Jun 22, 2023Updated 2 years ago