pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
☆146Apr 13, 2026Updated 2 months ago
Alternatives and similar repositories for pyDVL
Users that are interested in pyDVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆101Feb 4, 2025Updated last year
- [CVPR 2021] Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?☆34Dec 26, 2020Updated 5 years ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Jul 26, 2023Updated 2 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Nov 10, 2022Updated 3 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆81Oct 3, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆54Jun 5, 2024Updated 2 years ago
- A library for calibrating classifiers and computing calibration metrics☆14Nov 28, 2022Updated 3 years ago
- This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (…☆14Oct 26, 2023Updated 2 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 4 months ago
- Data Shapley: Equitable Valuation of Data for Machine Learning☆296May 1, 2024Updated 2 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆123Mar 24, 2026Updated 2 months ago
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated last year
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 11 months ago
- ☆51Mar 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Papers about training data quality management for ML models.☆120Updated this week
- A Python Data Valuation Package☆35Feb 3, 2023Updated 3 years ago
- Replication data and code for "Prestige drives epistemic inequality in the diffusion of scientific ideas"☆14Dec 14, 2018Updated 7 years ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆347Oct 29, 2023Updated 2 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆14Mar 20, 2021Updated 5 years ago
- ☆11May 11, 2022Updated 4 years ago
- Repo - Paper "Capturing Semantics for Imputation with Pre-trained Language Models." [ICDE 2021]☆10Mar 13, 2022Updated 4 years ago
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆20Apr 27, 2023Updated 3 years ago
- Efficient approximation algorithms for Shapley Values in Horizontal Enterprise Federated Learning☆13May 5, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Training-free data valuation on deep neural network applications. (ICML-2022)☆27Jul 13, 2022Updated 3 years ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆18Sep 15, 2023Updated 2 years ago
- ☆32May 24, 2023Updated 3 years ago
- Egocentric Temporal Motifs Miner☆13Nov 9, 2021Updated 4 years ago
- AI Logging for Interpretability and Explainability🔬☆145Jun 7, 2024Updated 2 years ago
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Dec 30, 2022Updated 3 years ago
- Pairwise Difference Learning (PDL) is a meta-learning framework that leverages pairwise differences to transform multiclass problems into…☆22Aug 27, 2025Updated 9 months ago
- Reading list for "The Shapley Value in Machine Learning" (JCAI 2022)☆155Aug 8, 2022Updated 3 years ago
- Official code repository for our accepted work "Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning" in NeurI…☆28Sep 28, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mixture of Decision Trees for Interpretable Machine Learning☆11Sep 2, 2021Updated 4 years ago
- ☆13Feb 25, 2022Updated 4 years ago
- Analysis of evidential models☆15Jun 22, 2023Updated 2 years ago
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.☆26Mar 6, 2025Updated last year
- ☆18Mar 1, 2022Updated 4 years ago
- Achieve error-rate fairness between societal groups for any score-based classifier.☆19Aug 21, 2025Updated 9 months ago
- For distributed machine learning☆32Mar 2, 2026Updated 3 months ago