koayon / awesome-sparse-autoencodersView external linksLinks
A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability
☆30Jan 30, 2025Updated last year
Alternatives and similar repositories for awesome-sparse-autoencoders
Users that are interested in awesome-sparse-autoencoders are comparing it to the libraries listed below
Sorting:
- ☆389Aug 21, 2025Updated 5 months ago
- ☆16Mar 5, 2024Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Sparse Autoencoder Training Library☆56May 1, 2025Updated 9 months ago
- Code repo for the model organisms and convergent directions of EM papers.☆49Sep 22, 2025Updated 4 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …☆124Updated this week
- Attribution-based Parameter Decomposition☆33Jun 11, 2025Updated 8 months ago
- ☆12Jul 8, 2024Updated last year
- Variational Auto-Encoder implementation in Tensorflow☆10Jan 22, 2017Updated 9 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- The official repository for "Piano score rearrangement into multiple difficulty levels via notation-to-notation approach" incl. ST+ token…☆12Feb 26, 2024Updated last year
- Pytorch implementation of HCNAF: Hyper-Conditioned Neural Autoregressive Flow (CVPR 2020)☆15Jun 14, 2020Updated 5 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- BachDuet enables a human performer to improvise a duet counterpoint with a computer agent in real time.☆14Aug 8, 2022Updated 3 years ago
- ☆16Aug 19, 2024Updated last year
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆13Jun 1, 2024Updated last year
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- Machine Learning Reading Group☆11Sep 15, 2023Updated 2 years ago
- a fast and customizable CUDA int4 tensor core gemm☆15Aug 2, 2024Updated last year
- Compilation of ML/AI Resources for Members of MITxHarvard Women in AI☆11Mar 28, 2022Updated 3 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- Python code used to analyze and process symbolic drum patterns☆14May 8, 2023Updated 2 years ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆31Oct 26, 2025Updated 3 months ago
- The official implementation of NeurIPS2024 paper "SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning."☆10May 28, 2025Updated 8 months ago
- ☆13Apr 10, 2025Updated 10 months ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"☆10Jun 23, 2024Updated last year
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆18Nov 17, 2025Updated 2 months ago
- This repository is the summary of all of our works for the XLA.☆11Jan 14, 2018Updated 8 years ago
- ☆14Sep 13, 2022Updated 3 years ago
- Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting☆12Mar 24, 2023Updated 2 years ago
- Tools for studying developmental interpretability in neural networks.☆126Dec 30, 2025Updated last month