pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
☆144Feb 11, 2026Updated last month
Alternatives and similar repositories for pyDVL
Users that are interested in pyDVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆100Feb 4, 2025Updated last year
- [CVPR 2021] Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?☆34Dec 26, 2020Updated 5 years ago
- Learning function operators with neural networks.☆35Aug 22, 2024Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Nov 10, 2022Updated 3 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆81Oct 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆53Jun 5, 2024Updated last year
- The Python library for sensible AI.☆73Mar 2, 2026Updated 3 weeks ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated last month
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆187Jun 24, 2025Updated 9 months ago
- A fire-tested template for production grade python libraries and packages.☆18Jul 15, 2025Updated 8 months ago
- Data Shapley: Equitable Valuation of Data for Machine Learning☆291May 1, 2024Updated last year
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆120Updated this week
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆27Jan 21, 2022Updated 4 years ago
- ☆65Jan 13, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated 11 months ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 8 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆87Feb 27, 2024Updated 2 years ago
- A simple PyTorch implementation of influence functions.☆92Jun 17, 2024Updated last year
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆27Feb 21, 2020Updated 6 years ago
- ☆51Mar 24, 2023Updated 3 years ago
- Papers about training data quality management for ML models.☆112Mar 20, 2026Updated last week
- A Python Data Valuation Package☆34Feb 3, 2023Updated 3 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆45Aug 26, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆26Oct 13, 2024Updated last year
- Replication data and code for "Prestige drives epistemic inequality in the diffusion of scientific ideas"☆14Dec 14, 2018Updated 7 years ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆344Oct 29, 2023Updated 2 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆14Mar 20, 2021Updated 5 years ago
- Repo - Paper "Capturing Semantics for Imputation with Pre-trained Language Models." [ICDE 2021]☆10Mar 13, 2022Updated 4 years ago
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆20Apr 27, 2023Updated 2 years ago
- Efficient approximation algorithms for Shapley Values in Horizontal Enterprise Federated Learning☆13May 5, 2020Updated 5 years ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- A python library to discover and mitigate biases in machine learning models and datasets☆20Jul 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 2 years ago
- Egocentric Temporal Motifs Miner☆12Nov 9, 2021Updated 4 years ago
- ☆13Feb 24, 2020Updated 6 years ago
- Official Implementation of Unweighted Data Subsampling via Influence Function - AAAI 2020☆64Apr 14, 2021Updated 4 years ago
- Reading list for "The Shapley Value in Machine Learning" (JCAI 2022)☆154Aug 8, 2022Updated 3 years ago
- Pairwise Difference Learning (PDL) is a meta-learning framework that leverages pairwise differences to transform multiclass problems into…☆22Aug 27, 2025Updated 7 months ago
- ☆13Feb 25, 2022Updated 4 years ago