Trustworthy-ML-Lab / Linear-Explanations
☆10Updated 2 months ago
Alternatives and similar repositories for Linear-Explanations:
Users that are interested in Linear-Explanations are comparing it to the libraries listed below
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆28Updated last year
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆33Updated 3 months ago
- LCA-on-the-line (ICML 2024 Oral)☆11Updated last week
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆59Updated 3 weeks ago
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆21Updated 3 weeks ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Works…☆12Updated 8 months ago
- ☆37Updated 3 months ago
- What do we learn from inverting CLIP models?☆49Updated 11 months ago
- An automatic and efficient tool to describe functionalities of individual neurons in DNNs☆43Updated last year
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆56Updated last year
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆15Updated 10 months ago
- ☆31Updated last year
- Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation☆30Updated last year
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆30Updated last year
- ☆14Updated this week
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆40Updated 11 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆39Updated 3 months ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆18Updated last week
- Code for the paper "A Whac-A-Mole Dilemma Shortcuts Come in Multiples Where Mitigating One Amplifies Others"☆47Updated 7 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆32Updated 3 months ago
- Code for T-MARS data filtering☆35Updated last year
- A simple and efficient baseline for data attribution☆11Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆74Updated 6 months ago
- Implementation of Concept-level Debugging of Part-Prototype Networks☆11Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆63Updated 8 months ago
- ☆12Updated 11 months ago
- ☆27Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆95Updated last year