OSU-NLP-Group/saev

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OSU-NLP-Group/saev)

OSU-NLP-Group / saev

Sparse autoencoders for vision

☆61

Alternatives and similar repositories for saev

Users that are interested in saev are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Imageomics / bioclip-2
View on GitHub
Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight] BioCLIP 2 is a biological foundation model trained on TreeOfLife-200M.…
☆67Updated this week
zer0int / CLIP-SAE-finetune
View on GitHub
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
☆18Dec 19, 2024Updated last year
OSU-NLP-Group / KG-R3
View on GitHub
Code for the CIKM'23 paper "A Retrieve-and-Read Framework for Knowledge Graph Link Prediction"
☆13Mar 23, 2025Updated last year
bscho333 / ReVisiT
View on GitHub
[ACL 2026 Main] Revisit What You See: Revealing Visual Semantics in Vision Tokens to Guide LVLM Decoding
☆25Nov 21, 2025Updated 6 months ago
dynamical-inference / patchsae
View on GitHub
Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"
☆30Apr 22, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Imageomics / INTR
View on GitHub
This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.
☆59Apr 15, 2024Updated 2 years ago
mrwu-mac / R-Bench
View on GitHub
[ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'
☆24Jan 1, 2025Updated last year
nickjiang2378 / vlm-hallucinations
View on GitHub
[ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"
☆103Nov 30, 2025Updated 5 months ago
Sadcardation / ImageProtector
View on GitHub
Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…
☆19Apr 17, 2026Updated last month
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
HugoFry / mats_sae_training_for_ViTs
View on GitHub
☆25Apr 23, 2024Updated 2 years ago
ahadjawaid / openego
View on GitHub
☆31Mar 8, 2026Updated 2 months ago
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆28Jun 11, 2025Updated 11 months ago
ExplainableML / sae-for-vlm
View on GitHub
[NeurIPS 2025] Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
☆84Nov 27, 2025Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OSU-NLP-Group / AgentAttack
View on GitHub
☆22Oct 25, 2024Updated last year
Prisma-Multimodal / ViT-Prisma
View on GitHub
ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).
☆363Jul 23, 2025Updated 10 months ago
SALT-NLP / PersuationGames
View on GitHub
[ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…
☆16Feb 22, 2025Updated last year
XXXVincent / MonoDepth2
View on GitHub
Mono depth on nuscenes dataset
☆21Feb 25, 2021Updated 5 years ago
BriansIDP / video-SALMONN-o1
View on GitHub
☆39Aug 26, 2025Updated 8 months ago
saprmarks / dictionary_learning
View on GitHub
☆416Aug 21, 2025Updated 9 months ago
Imageomics / Finer-CAM
View on GitHub
This is an official implementation for Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation. [CVPR'25] Better …
☆49Updated this week
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
JindongGu / GraCapsNet
View on GitHub
A pytorch implementation of the AAAI2021 paper GraCapsNet: Interpretable Graph Capsule Networks for Object Recognition
☆10Oct 2, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eujhwang / vn-analysis
View on GitHub
virtual node analysis on ogb benchmark dataset
☆14Mar 9, 2023Updated 3 years ago
EvolvingLMMs-Lab / multimodal-sae
View on GitHub
[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆198Sep 26, 2025Updated 7 months ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆20Dec 14, 2025Updated 5 months ago
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆32Feb 10, 2026Updated 3 months ago
tonychenxyz / vit-interpret
View on GitHub
Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"
☆14May 29, 2024Updated last year
OSU-NLP-Group / Mind2Web-2
View on GitHub
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
☆111May 17, 2026Updated last week
abhi1nandy2 / yesbut_dataset
View on GitHub
YesBut - Multimodal Satire Comprehension Dataset
☆19Oct 23, 2024Updated last year
Sungwon-Han / DualFair
View on GitHub
☆14Jan 28, 2023Updated 3 years ago
Infini-AI-Lab / M2PO
View on GitHub
☆30Oct 8, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆28Feb 17, 2026Updated 3 months ago
jeniyat / Candidacy-Template-OSU-CSE
View on GitHub
☆10Dec 18, 2020Updated 5 years ago
OSU-MLB / Fine-Tuning-Is-Fine-If-Calibrated
View on GitHub
Official Implementation of "Fine-Tuning is Fine, if Calibrated.", NeurIPS 2024
☆21Apr 25, 2025Updated last year
ppuliu / GloRE
View on GitHub
Global Relation Embedding for Relation Extraction
☆66Sep 4, 2018Updated 7 years ago
google-deepmind / geckonum_benchmark_t2i
View on GitHub
GeckoNum Benchmark for T2I Model Eval.
☆15Dec 5, 2024Updated last year
KihoPark / linear_rep_geometry
View on GitHub
☆121Feb 11, 2025Updated last year
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
More reliable Video Understanding Evaluation
☆15Sep 23, 2025Updated 8 months ago