koayon/awesome-sparse-autoencoders

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/koayon/awesome-sparse-autoencoders)

koayon / awesome-sparse-autoencoders

A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability

☆33

Alternatives and similar repositories for awesome-sparse-autoencoders

Users that are interested in awesome-sparse-autoencoders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
saprmarks / dictionary_learning
View on GitHub
☆428Aug 21, 2025Updated 11 months ago
slavachalnev / SAE-TS
View on GitHub
Improving Steering Vectors by Targeting Sparse Autoencoder Features
☆29Nov 20, 2024Updated last year
jiahai-feng / binding-iclr
View on GitHub
☆19Mar 5, 2024Updated 2 years ago
ag8 / sha-transformer
View on GitHub
☆12Jul 8, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lyh6560new / P3Sum
View on GitHub
The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"
☆10Jun 23, 2024Updated 2 years ago
ajobi-uhc / seer
View on GitHub
This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …
☆146Feb 8, 2026Updated 5 months ago
DanielSc4 / Dynamic-Activation-Composition
View on GitHub
Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year
Harvard-CS-2881 / harvard-cs-2881-hw0
View on GitHub
harvard-cs-2881-classroom-hw0-c2881-hw0 created by GitHub Classroom
☆16Jul 26, 2025Updated last year
YeeZ93 / Awesome-Object-Centric-Learning
View on GitHub
A curated list of researches in object-centric learning
☆11Oct 14, 2024Updated last year
zjunlp / WorldMind
View on GitHub
Aligning Agentic World Models via Knowledgeable Experience Learning
☆37May 15, 2026Updated 2 months ago
JindongJiang / SlotSSMs
View on GitHub
Official Release of NeurIPS 2024 paper "Slot State Space Models"
☆11Mar 22, 2025Updated last year
Lexsi-Labs / xai_evals
View on GitHub
Evaluation Matrices for Explainability Methods
☆15Nov 5, 2025Updated 8 months ago
FlyingPumba / InterpBench
View on GitHub
A benchmark for mechanistic discovery of circuits in Transformers
☆17Dec 15, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ApolloResearch / e2e_sae
View on GitHub
Sparse Autoencoder Training Library
☆58May 1, 2025Updated last year
Jometeorie / MultiHopShortcuts
View on GitHub
Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"
☆14Jun 1, 2024Updated 2 years ago
h-yu16 / DomainBed-v2
View on GitHub
☆13Oct 10, 2024Updated last year
JoshEngels / MultiDimensionalFeatures
View on GitHub
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆90Nov 27, 2024Updated last year
dsb-ifi / SPiT
View on GitHub
A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
☆23Sep 12, 2025Updated 10 months ago
clarifying-EM / model-organisms-for-EM
View on GitHub
Code repo for the model organisms and convergent directions of EM papers.
☆72Sep 22, 2025Updated 10 months ago
QosmoInc / NeuralBeatbox_ML_Examples
View on GitHub
☆14Sep 13, 2022Updated 3 years ago
facebookresearch / scalable-curvature
View on GitHub
Code for Dayal Kalra's research internship on scalable curvature measures for neural networks.
☆29Feb 3, 2026Updated 5 months ago
andreamust / ChordSync
View on GitHub
Code for ChordSync, a conformer-based audio-to-chord synchroniser
☆14Oct 17, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
callummcdougall / sae_vis
View on GitHub
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆268Feb 27, 2026Updated 5 months ago
trestad / Factual-Recall-Mechanism
View on GitHub
The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.
☆13Apr 10, 2024Updated 2 years ago
ThomasYerxa / mmcr
View on GitHub
☆21Sep 16, 2024Updated last year
mingukjang / TAST
View on GitHub
Test-time adaptation via Nearest neighbor information (TAST), submitted to ICLR'23
☆24Jul 11, 2023Updated 3 years ago
zjunlp / knowledge-rumination
View on GitHub
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
☆17Jun 29, 2023Updated 3 years ago
christofw / multipitch_architectures
View on GitHub
Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …
☆15Aug 26, 2022Updated 3 years ago
MTG / music-explore
View on GitHub
App to explore latent spaces of music collections
☆38Dec 1, 2025Updated 7 months ago
danielgomezmarin / rhythmtoolbox
View on GitHub
Python code used to analyze and process symbolic drum patterns
☆14May 8, 2023Updated 3 years ago
Dakingrai / awesome-mechanistic-interpretability-lm-papers
View on GitHub
☆260Nov 22, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jephianlin / FIS-la-solutions
View on GitHub
Solutions to the textbook Linear Algebra by Friedberg, Insel, and Spence
☆17Jul 17, 2023Updated 3 years ago
esennesh / dcpc_paper
View on GitHub
☆12Aug 26, 2025Updated 11 months ago
julianmichael / debate
View on GitHub
Debate interface, experiments, etc.
☆11Mar 12, 2024Updated 2 years ago
IDEA-XL / SubgDiff
View on GitHub
The official implementation of NeurIPS2024 paper "SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning."
☆11May 28, 2025Updated last year
Phylliida / MambaLens
View on GitHub
Mamba support for transformer lens
☆20Sep 17, 2024Updated last year
jnika / ACE_Analyzer
View on GitHub
Qualitative evaluation of automatic chord extraction results: analysis of the musical relationships between predicted chords and target c…
☆10Oct 25, 2021Updated 4 years ago
Xiao-Ming / VAEChordEstimation
View on GitHub
Implementation of the experiments for "Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Label…
☆11Dec 3, 2020Updated 5 years ago