A Python Data Valuation Package
☆35Feb 3, 2023Updated 3 years ago
Alternatives and similar repositories for valda
Users that are interested in valda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Jul 26, 2023Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆54Jun 5, 2024Updated 2 years ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 11 months ago
- This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (…☆14Oct 26, 2023Updated 2 years ago
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆143Feb 20, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [CVPR 2021] Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?☆34Dec 26, 2020Updated 5 years ago
- The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)☆15Nov 20, 2025Updated 6 months ago
- Official implementation of "Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation" (NeurIPS 2025)☆20Apr 2, 2026Updated 2 months ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆101Feb 4, 2025Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Nov 10, 2022Updated 3 years ago
- Data Banzhaf: A Robust Data Valuation Framework for Machine Learning (AISTATS 2023 Oral)☆18Oct 15, 2023Updated 2 years ago
- Using a MaskRCNN model trained on the PublayNet dataset with ML.Net in C# / .Net for Document layout analysis and page segmmentation task…☆17May 13, 2023Updated 3 years ago
- Detect communities in legal networks☆12Dec 15, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆27Feb 21, 2020Updated 6 years ago
- [ICML 2026] Official Code for Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations☆76Feb 15, 2026Updated 4 months ago
- DP-Rewrite: Towards Reproducibility and Transparency in Differentially Private Text Rewriting☆15Apr 27, 2023Updated 3 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation☆146Apr 13, 2026Updated 2 months ago
- Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)☆17Mar 6, 2025Updated last year
- Phishing detection using GNNs (SECRYPT'22)☆15Jun 6, 2025Updated last year
- Official repository for the AAAI2026 paper (Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery …☆29Apr 24, 2026Updated last month
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Dec 8, 2021Updated 4 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 11 months ago
- Robust Camera Trace Extraction (TIFS'23)☆12Oct 3, 2023Updated 2 years ago
- Data Shapley: Equitable Valuation of Data for Machine Learning☆296May 1, 2024Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- ☆16May 23, 2025Updated last year
- Official code repository for our accepted work "Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning" in NeurI…☆28Sep 28, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for ICLR 2022 paper. "Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency Detection".☆18Mar 23, 2022Updated 4 years ago
- Under construction☆11Nov 20, 2022Updated 3 years ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Mar 26, 2026Updated 2 months ago
- Web Photo Source Identification based on Neural Enhanced Camera Fingerprint (WWW2023)☆15Feb 25, 2023Updated 3 years ago
- Resources for face forgery detection☆10Jul 6, 2022Updated 3 years ago
- Code and Hummingbird dataset for EMNLP 2021 paper "Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica"☆14Apr 13, 2022Updated 4 years ago
- Every work on Federated Learning Pruning☆23Feb 10, 2023Updated 3 years ago