dynamical-inference / patchsaeLinks
Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"
☆14Updated last month
Alternatives and similar repositories for patchsae
Users that are interested in patchsae are comparing it to the libraries listed below
Sorting:
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆19Updated 9 months ago
- ☆40Updated 9 months ago
- Sparse autoencoders for vision☆31Updated this week
- What do we learn from inverting CLIP models?☆54Updated last year
- Erasing conceptual knowledge from language models through low-rank fine-tuning☆18Updated 2 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆23Updated 3 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆73Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆102Updated last year
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆73Updated last week
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆30Updated last year
- ☆62Updated 8 months ago
- ☆19Updated 2 months ago
- [TMLR 2025] On Memorization in Diffusion Models☆26Updated last year
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆39Updated 6 months ago
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆56Updated 2 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last week
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆38Updated 4 months ago
- [ICML 2025] Unlearning in Diffusion Models using Sparse Autoencoders☆24Updated 3 weeks ago
- ☆53Updated 7 months ago
- [CVPR 2024] Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation☆40Updated last year
- ☆17Updated 5 months ago
- Sparse Linear Concept Embeddings☆98Updated 2 months ago
- ☆27Updated last month
- [ICLR 23 spotlight] An automatic and efficient tool to describe functionalities of individual neurons in DNNs☆50Updated last year
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆11Updated 2 months ago
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆46Updated last month
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆56Updated 5 months ago
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…☆67Updated 6 months ago
- LCA-on-the-line (ICML 2024 Oral)☆11Updated 3 months ago
- Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness