Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)
☆21Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for dataoob
Users that are interested in dataoob are comparing it to the libraries listed below
Sorting:
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Jun 5, 2024Updated last year
- A Python Data Valuation Package☆33Feb 3, 2023Updated 3 years ago
- [CVPR 2021] Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?☆33Dec 26, 2020Updated 5 years ago
- DP-HyperparamTuning offers an array of tools for fast and easy hypertuning of various hyperparameters for the DP-SGD algorithm.☆23Sep 27, 2021Updated 4 years ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 7 months ago
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆26Feb 21, 2020Updated 6 years ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆100Feb 4, 2025Updated last year
- Measuring data importance over ML pipelines using the Shapley value.☆45Aug 26, 2025Updated 6 months ago
- Official code for Fisher information embedding for node and graph learning (ICML 2023)☆19Jun 17, 2023Updated 2 years ago
- ☆15Nov 12, 2021Updated 4 years ago
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Nov 10, 2022Updated 3 years ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆17Jun 20, 2020Updated 5 years ago
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation☆142Feb 11, 2026Updated 2 weeks ago
- [ICLR 2022] "Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity" by Shiwei Liu,…☆27Jun 15, 2022Updated 3 years ago
- Data Shapley: Equitable Valuation of Data for Machine Learning☆288May 1, 2024Updated last year
- ☆26Mar 17, 2023Updated 2 years ago
- [CVPR 2022] Implementation of "Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search"☆23Jan 9, 2023Updated 3 years ago
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28May 6, 2023Updated 2 years ago
- Official Implementation of Unweighted Data Subsampling via Influence Function - AAAI 2020☆64Apr 14, 2021Updated 4 years ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆30May 1, 2024Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Oct 3, 2024Updated last year
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- ☆46Dec 20, 2020Updated 5 years ago
- Team FDVTS_DR's solutions for MICCAI2022 Diabetic Retinopathy Analysis Challenge (DRAC)☆14Mar 5, 2024Updated last year
- ☆10Jul 16, 2023Updated 2 years ago
- ☆12Jan 5, 2026Updated last month
- pFedDef: Defending Grey-Box Attacks for Personalized Federated Learning☆10May 31, 2023Updated 2 years ago
- Framework for geospatial research CNN development.☆11Apr 10, 2021Updated 4 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- Two-stream remote sensing model for water quality mapping: 2SeaColor☆10Feb 2, 2021Updated 5 years ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- Automatically notifies you of start and completion using environment variables☆13Aug 4, 2023Updated 2 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Multimodal SuperCon: Classifier for Drivers of Deforestation in Indonesia☆10Nov 18, 2023Updated 2 years ago
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97May 25, 2023Updated 2 years ago