k-hanawa / criteria_for_instance_based_explanation
☆9Updated last year
Alternatives and similar repositories for criteria_for_instance_based_explanation:
Users that are interested in criteria_for_instance_based_explanation are comparing it to the libraries listed below
- ☆28Updated 7 months ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 3 years ago
- ☆26Updated last year
- ☆14Updated 4 years ago
- https://arxiv.org/abs/2102.12594☆14Updated last year
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- ☆11Updated 2 years ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆56Updated last year
- tianlu-wang / Identifying-and-Mitigating-Spurious-Correlations-for-Improving-Robustness-in-NLP-ModelsNAACL 2022 Findings☆15Updated 2 years ago
- ☆87Updated last year
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆16Updated 4 years ago
- A simple PyTorch implementation of influence functions.☆84Updated 8 months ago
- ☆60Updated 3 years ago
- ☆11Updated 2 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆70Updated 9 months ago
- ☆26Updated 3 years ago
- [ACL 2020] Towards Debiasing Sentence Representations☆64Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆15Updated last year
- Post-processing for fair classification☆13Updated last month
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆69Updated 11 months ago
- ☆64Updated 2 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆62Updated 3 months ago