A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability
☆32Jan 30, 2025Updated last year
Alternatives and similar repositories for awesome-sparse-autoencoders
Users that are interested in awesome-sparse-autoencoders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆422Aug 21, 2025Updated 9 months ago
- ☆19Mar 5, 2024Updated 2 years ago
- This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …☆147Feb 8, 2026Updated 4 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Variational Auto-Encoder implementation in Tensorflow☆10Jan 22, 2017Updated 9 years ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Mar 18, 2026Updated 3 months ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated last year
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆14Jun 1, 2024Updated 2 years ago
- Sparse Autoencoder Training Library☆57May 1, 2025Updated last year
- ☆28Oct 30, 2025Updated 7 months ago
- ☆22Sep 16, 2025Updated 9 months ago
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- ☆26Jun 29, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆48Jan 3, 2026Updated 5 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆87Nov 27, 2024Updated last year
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆20Nov 17, 2025Updated 7 months ago
- ☆32Sep 19, 2025Updated 9 months ago
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆33Nov 4, 2025Updated 7 months ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆33Sep 18, 2025Updated 9 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆261Feb 27, 2026Updated 3 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆18Jun 29, 2023Updated 2 years ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19May 19, 2025Updated last year
- ☆253Nov 22, 2024Updated last year
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- ☆13Apr 10, 2025Updated last year
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.☆36Sep 26, 2024Updated last year
- ☆12Aug 26, 2025Updated 9 months ago
- Exploring Model Kinship for Merging Large Language Models☆29Apr 16, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The official implementation of NeurIPS2024 paper "SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning."☆11May 28, 2025Updated last year
- The musical corpus of flow (124 transcriptions of popular rap songs)☆15Apr 25, 2024Updated 2 years ago
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- The official repository for "Piano score rearrangement into multiple difficulty levels via notation-to-notation approach" incl. ST+ token…☆13Feb 26, 2024Updated 2 years ago
- Implementation of the experiments for "Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Label…☆11Dec 3, 2020Updated 5 years ago
- [CVPR 2026 Fingdings] This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑La…☆28Mar 15, 2026Updated 3 months ago
- ☆11Nov 30, 2024Updated last year