k-hanawa / criteria_for_instance_based_explanationLinks
☆9Updated 2 years ago
Alternatives and similar repositories for criteria_for_instance_based_explanation
Users that are interested in criteria_for_instance_based_explanation are comparing it to the libraries listed below
Sorting:
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆79Updated last year
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆87Updated last year
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 4 years ago
- ☆11Updated 3 years ago
- ☆31Updated last year
- ☆25Updated last month
- ☆17Updated 4 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆71Updated 9 months ago
- ☆70Updated 3 years ago
- ☆89Updated 2 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆74Updated 4 months ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆12Updated 8 months ago
- ☆89Updated 3 years ago
- https://arxiv.org/abs/2102.12594☆14Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆212Updated 7 months ago
- A simple PyTorch implementation of influence functions.☆89Updated last year
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆72Updated last year
- Bayesian low-rank adaptation for large language models☆23Updated last year
- ☆26Updated 4 years ago
- ☆29Updated last year
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆31Updated 5 years ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- ☆14Updated 5 years ago
- ☆32Updated last year
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- [ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…☆85Updated last year
- ☆31Updated last year
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆16Updated 5 years ago
- ☆62Updated 4 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago