dynamical-inference / patchsae
Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"
☆12Updated last week
Alternatives and similar repositories for patchsae:
Users that are interested in patchsae are comparing it to the libraries listed below
- Sparse autoencoders for vision☆28Updated last week
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆39Updated 5 months ago
- What do we learn from inverting CLIP models?☆54Updated last year
- Erasing conceptual knowledge from language models through low-rank fine-tuning☆17Updated last month
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆72Updated last year
- ☆53Updated 6 months ago
- ☆43Updated 5 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆67Updated 2 months ago
- ☆38Updated 8 months ago
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆52Updated last month
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆18Updated 8 months ago
- A curated list of Awesome Personalized Large Multimodal Models resources☆20Updated last month
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆76Updated 7 months ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆11Updated last month
- Data distillation benchmark☆58Updated this week
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆80Updated 2 months ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆28Updated last year
- [TMLR 2025] On Memorization in Diffusion Models☆24Updated last year
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆23Updated 2 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆35Updated 6 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆101Updated last year
- Holistic evaluation of multimodal foundation models☆47Updated 8 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆22Updated this week
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆50Updated 11 months ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆11Updated 8 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆54Updated 4 months ago
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆41Updated last week
- ☆14Updated last year
- Unlearning in Diffusion Models using Sparse Autoencoders☆20Updated last month
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆23Updated 3 months ago