dynamical-inference / patchsaeLinks
Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"
☆28Updated 3 months ago
Alternatives and similar repositories for patchsae
Users that are interested in patchsae are comparing it to the libraries listed below
Sorting:
- Sparse autoencoders for vision☆55Updated 3 weeks ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆108Updated 2 years ago
- What do we learn from inverting CLIP models?☆58Updated last year
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆37Updated 2 years ago
- ☆24Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Updated last year
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆96Updated 2 months ago
- ☆79Updated last year
- [ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliabilit…☆29Updated 5 months ago
- Localization of Knowledge in Text-to-Image Models☆12Updated last year
- Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.☆55Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆42Updated last week
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Updated 10 months ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Updated 4 months ago
- [ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models☆152Updated 7 months ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Updated 9 months ago
- ☆13Updated 9 months ago
- 👋 Overcomplete is a Vision-based SAE Toolbox☆117Updated last month
- ☆16Updated 9 months ago
- ☆27Updated 2 months ago
- SpuCo is a Python package developed to further research to address spurious correlations.☆25Updated last year
- [NeurIPS 2025] Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models☆60Updated 2 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆77Updated last year
- [ICLR 23 spotlight] An automatic and efficient tool to describe functionalities of individual neurons in DNNs☆59Updated 2 years ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Works…☆19Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆51Updated last month
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆27Updated last year
- A curated list of Awesome Personalized Large Multimodal Models resources☆52Updated 2 weeks ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆174Updated 4 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year