dynamical-inference / patchsaeLinks
Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"
☆18Updated 2 months ago
Alternatives and similar repositories for patchsae
Users that are interested in patchsae are comparing it to the libraries listed below
Sorting:
- What do we learn from inverting CLIP models?☆55Updated last year
- ☆18Updated 6 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆102Updated 2 years ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆14Updated 7 months ago
- Sparse autoencoders for vision☆37Updated 3 weeks ago
- ☆19Updated 3 months ago
- ☆44Updated 10 months ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆31Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆29Updated 8 months ago
- Erasing conceptual knowledge from language models through low-rank fine-tuning☆19Updated 3 months ago
- ☆57Updated 8 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆28Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆38Updated 8 months ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Updated 3 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆75Updated last month
- ☆22Updated 3 weeks ago
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆26Updated 5 months ago
- Holistic evaluation of multimodal foundation models☆48Updated 11 months ago
- ☆16Updated 2 months ago
- ☆51Updated 7 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆10Updated 5 months ago
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆39Updated 8 months ago
- Official Repository of Personalized Visual Instruct Tuning☆31Updated 4 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆76Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 7 months ago
- [ICLR 23 spotlight] An automatic and efficient tool to describe functionalities of individual neurons in DNNs☆54Updated last year
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆145Updated last week
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆59Updated 3 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 3 weeks ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆10Updated 7 months ago