Sparse autoencoders for vision
☆57Mar 17, 2026Updated last week
Alternatives and similar repositories for saev
Users that are interested in saev are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Symmetric Encryption with Language Models☆13Jun 13, 2023Updated 2 years ago
- Code for reproducing our paper "Are Sparse Autoencoders Useful? A Case Study in Sparse Probing"☆32Mar 31, 2025Updated 11 months ago
- Code for the CIKM'23 paper "A Retrieve-and-Read Framework for Knowledge Graph Link Prediction"☆12Mar 23, 2025Updated last year
- A dataset of 1M insect specimens with DNA barcodes, taxonomy, and images.☆33Apr 30, 2025Updated 10 months ago
- [NeurIPS 24] A new training and evaluation framework for learning interpretable deep vision models and benchmarking different interpretab…☆30Jun 5, 2025Updated 9 months ago
- Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"☆30Oct 31, 2025Updated 4 months ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆101Nov 30, 2025Updated 3 months ago
- Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models☆18Oct 16, 2024Updated last year
- ☆25Apr 23, 2024Updated last year
- ☆13Oct 7, 2024Updated last year
- ☆27Feb 9, 2023Updated 3 years ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆348Jul 23, 2025Updated 8 months ago
- ☆23Oct 25, 2024Updated last year
- Official Implementation of understanding the latent space of diffusion models through the lens of riemannian geometry (NeurIPS 2023)☆93Feb 20, 2024Updated 2 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- ☆39Aug 26, 2025Updated 6 months ago
- This is an official implementation for Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation. [CVPR'25] Better …☆48Nov 10, 2025Updated 4 months ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Jupyter Notebooks from book UNDERSTANDING DEEP LEARNING (Prof Simon Prince) that I could solve.☆12Mar 20, 2024Updated 2 years ago
- ICIAP2022 - Learning Semantics for Visual Place Recognition through Multi-Scale Attention☆16May 10, 2022Updated 3 years ago
- A pytorch implementation of the AAAI2021 paper GraCapsNet: Interpretable Graph Capsule Networks for Object Recognition☆10Oct 2, 2022Updated 3 years ago
- Zero Allocation WASM☆58Feb 18, 2026Updated last month
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆19Jan 26, 2025Updated last year
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆173Sep 19, 2025Updated 6 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆158Jan 2, 2026Updated 2 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆191Sep 26, 2025Updated 5 months ago
- Official implementation of "Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation" (NeurIPS 2025)☆20Dec 2, 2025Updated 3 months ago
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- Weakly Supervised Object Localization via Class RE-Activation Mapping☆12Sep 19, 2022Updated 3 years ago
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆42Mar 16, 2026Updated last week
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆25Feb 25, 2026Updated 3 weeks ago
- ☆16Sep 6, 2024Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated last month
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- YesBut - Multimodal Satire Comprehension Dataset☆18Oct 23, 2024Updated last year
- Official Implementation of "Fine-Tuning is Fine, if Calibrated.", NeurIPS 2024☆21Apr 25, 2025Updated 10 months ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year