Trustworthy-ML-Lab / Linear-ExplanationsLinks

[ICML 24] A novel automated neuron explanation framework that can accurately describe poly-semantic concepts in deep neural networks

☆13

Alternatives and similar repositories for Linear-Explanations

Users that are interested in Linear-Explanations are comparing it to the libraries listed below

Sorting:

Trustworthy-ML-Lab / Describe-and-Dissect
[TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models
☆10Updated 5 months ago
adaminsky / compositional_concepts
Code for the CCE algorithm proposed in "Towards Compositionality in Concept Learning" at ICML 2024.
☆16Updated last year
hamidkazemi22 / CLIPInversion
What do we learn from inverting CLIP models?
☆55Updated last year
sail-sg / D-TRAK
Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)
☆31Updated last year
tmlabonte / last-layer-retraining
Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…
☆11Updated last year
abonte / protopdebug
Implementation of Concept-level Debugging of Part-Prototype Networks
☆11Updated 2 years ago
stanford-crfm / air-bench-2024
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
☆23Updated 11 months ago
peterljq / Parsimonious-Concept-Engineering
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆39Updated 9 months ago
yossigandelsman / second_order_lens
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
☆39Updated 8 months ago
ExplainableML / sae-for-vlm
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
☆24Updated 3 months ago
YanNeu / spurious_imagenet
Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet
☆32Updated last year
ZhentingWang / DUMP
☆22Updated 2 months ago
alinlab / b2t
Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation
☆31Updated 2 years ago
ElvishElvis / LCA-on-the-line
LCA-on-the-line (ICML 2024 Oral)
☆12Updated 5 months ago
BatsResearch / ex2
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Updated last year
rgeirhos / dataset-pruning-metrics
Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)
☆56Updated 2 years ago
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆103Updated 2 years ago
VITA-Group / Robust_Weight_Signatures
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16Updated 2 years ago
Model-GLUE / Model-GLUE
☆15Updated 11 months ago
k1rezaei / Text-to-concept
☆34Updated last year
piotr-teterwak / erm_plusplus
☆17Updated last year
facebookresearch / Whac-A-Mole
Code for the paper "A Whac-A-Mole Dilemma Shortcuts Come in Multiples Where Mitigating One Amplifies Others"
☆49Updated last year
YuYang0901 / CLIP-spurious-finetune
Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)
☆18Updated last year
MadryLab / dataset-interfaces
Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation
☆45Updated 2 years ago
tsb0601 / MultiMon
☆25Updated 2 years ago
locuslab / T-MARS
Code for T-MARS data filtering
☆35Updated last year
Wuyxin / DISC
(ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation
☆41Updated last year
tanganke / opcm
☆14Updated 6 months ago
OPTML-Group / DP4TL
[NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…
☆13Updated last year
Heidelberg-NLP / CC-SHAP-VLM
Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…
☆12Updated 4 months ago