HugoFry/mats_sae_training_for_ViTs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HugoFry/mats_sae_training_for_ViTs)

HugoFry / mats_sae_training_for_ViTs

☆25

Alternatives and similar repositories for mats_sae_training_for_ViTs

Users that are interested in mats_sae_training_for_ViTs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neuroexplicit-saar / Discover-then-Name
View on GitHub
Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.
☆59Nov 3, 2024Updated last year
vsahil / MIMETIC-2
View on GitHub
Official Code for MIMETIC^2
☆13Nov 19, 2024Updated last year
neelnanda-io / Neuroscope
View on GitHub
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆14Feb 13, 2023Updated 3 years ago
ArthurConmy / MishformerLens
View on GitHub
MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…
☆10Oct 7, 2024Updated last year
neelnanda-io / Crosscoders
View on GitHub
☆60Nov 19, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ypwang61 / negCLIPLoss_NormSim
View on GitHub
[NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.
☆14Dec 12, 2024Updated last year
dgcnz / FACT
View on GitHub
Code for [Re] On the Reproducibility of Post-Hoc Concept Bottleneck Models.
☆13Nov 27, 2024Updated last year
vedantpalit / Towards-Vision-Language-Mechanistic-Interpretability
View on GitHub
This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…
☆25Feb 16, 2026Updated 5 months ago
3cology / dinov2_with_attention_extraction
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆12Nov 12, 2023Updated 2 years ago
ckkissane / crosscoder-model-diff-replication
View on GitHub
Open source replication of Anthropic's Crosscoders for Model Diffing
☆68Oct 27, 2024Updated last year
konpanousis / ConceptDiscoveryModels
View on GitHub
This is the official implementation of the Concept Discovery Models paper.
☆15Aug 27, 2023Updated 2 years ago
Butanium / tiny-activation-dashboard
View on GitHub
A tiny easily hackable implementation of a feature dashboard.
☆17Oct 21, 2025Updated 9 months ago
arnavmdas / epiphany
View on GitHub
☆13May 12, 2023Updated 3 years ago
michiganleon / ReCLIP_WACV
View on GitHub
☆18Mar 4, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
hijohnnylin / neuronpedia-scorer
View on GitHub
☆17Feb 14, 2024Updated 2 years ago
MLAI-Yonsei / CaRot
View on GitHub
source code for NeurIPS'24 paper "Towards Calibrated Robust Fine-Tuning of Vision-Language Models"
☆15Oct 31, 2025Updated 8 months ago
CausalTriplet / causaltriplet
View on GitHub
[CLeaR23] Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning
☆31Apr 15, 2023Updated 3 years ago
mlfoundations / dataset2metadata
View on GitHub
☆28Mar 21, 2024Updated 2 years ago
tilde-research / sieve
View on GitHub
Applying SAEs for fine-grained control
☆27Dec 15, 2024Updated last year
kaiyuhwang / MLLM-Survey
View on GitHub
The paper list of multilingual pre-trained models (Continual Updated).
☆25Jun 18, 2024Updated 2 years ago
ApolloResearch / e2e_sae
View on GitHub
Sparse Autoencoder Training Library
☆58May 1, 2025Updated last year
decoderesearch / SAELens
View on GitHub
Training Sparse Autoencoders on Language Models
☆1,484Updated this week
XMUDeepLIT / SSR
View on GitHub
Code for "Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal" (ACL 2024)
☆17Oct 21, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
oclivegriffin / crosscode
View on GitHub
A library for training crosscoders
☆17May 28, 2025Updated last year
annahdo / implementing_activation_steering
View on GitHub
A collection of different ways to implement accessing and modifying internal model activations for LLMs
☆24Oct 18, 2024Updated last year
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
slavachalnev / SAE-TS
View on GitHub
Improving Steering Vectors by Targeting Sparse Autoencoder Features
☆29Nov 20, 2024Updated last year
saprmarks / dictionary_learning
View on GitHub
☆428Aug 21, 2025Updated 11 months ago
acmi-lab / RLSbench
View on GitHub
Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift
☆35Jul 19, 2023Updated 3 years ago
HumanCompatibleAI / leela-interp
View on GitHub
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
☆31Jun 4, 2024Updated 2 years ago
steineggerlab / afdb-clusters-analysis
View on GitHub
Scripts to generate and analyze afdb clusters
☆11Sep 15, 2023Updated 2 years ago
dynamical-inference / cytosae
View on GitHub
Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology
☆27Jul 17, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
adamkarvonen / SAEBench
View on GitHub
☆178May 1, 2026Updated 2 months ago
string-os / string
View on GitHub
Markdown that runs — one file, any agent.
☆49Updated this week
irom-princeton / byovla
View on GitHub
Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024
☆39Jan 22, 2025Updated last year
montemac / activation_additions
View on GitHub
Algebraic value editing in pretrained language models
☆71Nov 1, 2023Updated 2 years ago
Trustworthy-ML-Lab / Describe-and-Dissect
View on GitHub
[TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models
☆11Feb 20, 2025Updated last year
Karbo123 / recon
View on GitHub
an universal pytorch deep learning experiment codebase
☆11Mar 31, 2025Updated last year
zer0int / CLIP-SAE-finetune
View on GitHub
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
☆18Dec 19, 2024Updated last year