k-hanawa / criteria_for_instance_based_explanationLinks

☆9

Alternatives and similar repositories for criteria_for_instance_based_explanation

Users that are interested in criteria_for_instance_based_explanation are comparing it to the libraries listed below

Sorting:

ZaydH / influence_analysis_papers
Influence Analysis and Estimation - Survey, Papers, and Taxonomy
☆79Updated last year
kawine / dataset_difficulty
"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)
☆87Updated last year
ecreager / eiil
Code for Environment Inference for Invariant Learning (ICML 2021 Paper)
☆50Updated 4 years ago
dongxinshuai / RIFT-NeurIPS2021
☆11Updated 3 years ago
r-three / mats
☆31Updated last year
launchnlp / LitCab
☆25Updated last month
successar / instance_attributions_NLP
☆17Updated 4 years ago
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆71Updated 9 months ago
mmatena / model_merging
☆70Updated 3 years ago
salesforce / fast-influence-functions
☆89Updated 2 months ago
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆74Updated 4 months ago
zepingyu0512 / in-context-mechanism
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆12Updated 8 months ago
shauli-ravfogel / nullspace_projection
☆89Updated 3 years ago
princetonvisualai / directional-bias-amp
https://arxiv.org/abs/2102.12594
☆14Updated last year
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆212Updated 7 months ago
alstonlo / torch-influence
A simple PyTorch implementation of influence functions.
☆89Updated last year
anniesch / jtt
Code for "Just Train Twice: Improving Group Robustness without Training Group Information"
☆72Updated last year
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆23Updated last year
UKPLab / emnlp2020-debiasing-unknown
☆26Updated 4 years ago
dannyallover / overthinking_the_truth
☆29Updated last year
ssagawa / overparam_spur_corr
An Investigation of Why Overparameterization Exacerbates Spurious Correlations
☆31Updated 5 years ago
sylinrl / CalibratedMath
Teaching Models to Express Their Uncertainty in Words
☆39Updated 3 years ago
zleizzo / datadeletion
☆14Updated 5 years ago
tatsu-lab / conformal-factual-lm
☆32Updated last year
RobertCsordas / modules
The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…
☆46Updated last year
AI-secure / InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…
☆85Updated last year
UCSB-NLP-Chang / llm_uncertainty
☆31Updated last year
ryokamoi / pytorch_influence_functions
This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…
☆16Updated 5 years ago
google-research / heldout-influence-estimation
☆62Updated 4 years ago
harshays / inputgradients
Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)
☆13Updated 2 years ago