A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability
☆30Jan 30, 2025Updated last year
Alternatives and similar repositories for awesome-sparse-autoencoders
Users that are interested in awesome-sparse-autoencoders are comparing it to the libraries listed below
Sorting:
- ☆19Mar 5, 2024Updated 2 years ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 10 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Code repo for the model organisms and convergent directions of EM papers.☆53Sep 22, 2025Updated 5 months ago
- A library for efficient patching and automatic circuit discovery.☆90Dec 31, 2025Updated 2 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆247Feb 27, 2026Updated last week
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆31Nov 4, 2025Updated 4 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- ☆24Feb 18, 2026Updated 2 weeks ago
- Variational Auto-Encoder implementation in Tensorflow☆10Jan 22, 2017Updated 9 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- The musical corpus of flow (124 transcriptions of popular rap songs)☆12Apr 25, 2024Updated last year
- ☆12Jul 8, 2024Updated last year
- Compilation of ML/AI Resources for Members of MITxHarvard Women in AI☆11Mar 28, 2022Updated 3 years ago
- the source code of IJCAI 2023 paper "Multi-Scale subgraph contrastive learning"☆10Apr 25, 2023Updated 2 years ago
- Machine Learning Reading Group☆11Sep 15, 2023Updated 2 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- ☆15Aug 19, 2024Updated last year
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- Code for ChordSync, a conformer-based audio-to-chord synchroniser