This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
☆53Jun 5, 2024Updated last year
Alternatives and similar repositories for LAVA
Users that are interested in LAVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 8 months ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆100Feb 4, 2025Updated last year
- A Python Data Valuation Package☆34Feb 3, 2023Updated 3 years ago
- Optimal Transport Dataset Distance☆177May 23, 2022Updated 3 years ago
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆138Feb 20, 2025Updated last year
- Phishing detection using GNNs (SECRYPT'22)☆15Jun 6, 2025Updated 9 months ago
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation☆144Feb 11, 2026Updated last month
- ☆17Mar 23, 2025Updated last year
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆20Apr 27, 2023Updated 2 years ago
- Efficient approximation algorithms for Shapley Values in Horizontal Enterprise Federated Learning☆13May 5, 2020Updated 5 years ago
- ☆10Oct 31, 2022Updated 3 years ago
- The Linear Optimal Transport Framework☆16Oct 7, 2020Updated 5 years ago
- Implementation of FedBary☆16Mar 24, 2025Updated 11 months ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Papers about training data quality management for ML models.☆112Oct 15, 2025Updated 5 months ago
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Dec 30, 2022Updated 3 years ago
- [NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…☆21Oct 1, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆14Feb 26, 2025Updated last year
- ☆22Dec 15, 2023Updated 2 years ago
- Official code repository for our accepted work "Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning" in NeurI…☆26Sep 28, 2024Updated last year
- ☆17May 28, 2024Updated last year
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- [Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis☆10Sep 23, 2021Updated 4 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆118Mar 13, 2026Updated last week
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Oct 23, 2022Updated 3 years ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆344Oct 29, 2023Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆24Jan 12, 2025Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆81Oct 3, 2024Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Data Benchmarking☆23May 24, 2024Updated last year
- Official Implementation of NIPS 2022 paper Pre-activation Distributions Expose Backdoor Neurons☆15Jan 13, 2023Updated 3 years ago
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆13May 22, 2024Updated last year
- ☆29Jul 26, 2021Updated 4 years ago
- ☆39Feb 25, 2024Updated 2 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆45Aug 26, 2025Updated 6 months ago
- [ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning☆31Dec 2, 2023Updated 2 years ago
- ☆21Dec 5, 2021Updated 4 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year