An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"
☆57May 12, 2024Updated last year
Alternatives and similar repositories for sparse-dictionary-learning
Users that are interested in sparse-dictionary-learning are comparing it to the libraries listed below
Sorting:
- ☆134Oct 28, 2023Updated 2 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Open source repro of "Towards Monosemanticity"☆31May 6, 2024Updated last year
- Using sparse coding to find distributed representations used by neural networks.☆297Nov 10, 2023Updated 2 years ago
- ☆12Sep 16, 2024Updated last year
- A library for mechanistic anomaly detection☆22Jan 9, 2025Updated last year
- ☆32Feb 15, 2026Updated 2 weeks ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Training Sparse Autoencoders on Language Models☆1,219Updated this week
- ☆25Dec 20, 2023Updated 2 years ago
- Sparsify transformers with SAEs and transcoders☆696Feb 23, 2026Updated last week
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆30Oct 27, 2025Updated 4 months ago
- Serverless Optimized MODules - A Serverless Framework to create reusable micro apps☆18Jul 7, 2025Updated 7 months ago
- ☆571Jul 19, 2024Updated last year
- ☆29May 8, 2024Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆294Jan 22, 2026Updated last month
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Mar 8, 2025Updated 11 months ago
- ☆1,072Mar 6, 2024Updated last year
- ☆15Feb 7, 2025Updated last year
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆194Feb 18, 2026Updated last week
- CONFSEC's ComputeNode component of the OpenPCC standard☆17Dec 15, 2025Updated 2 months ago
- ☆17Updated this week
- ☆12Aug 15, 2023Updated 2 years ago
- ☆16Jun 10, 2024Updated last year
- Landing page for Global Privacy Control (GPC)☆12Feb 1, 2026Updated last month
- Grab some/all of CodeQL CLI binary, QL library, VSCode starter workspace, VSCode and VSCode QL extension☆11Jun 12, 2025Updated 8 months ago
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- ☆14Nov 25, 2024Updated last year
- A safe and easy way to run goroutines☆13Feb 3, 2023Updated 3 years ago
- Fast and simple C++ DSP engine with high-quality effects. Originally built for PhantomAmp, an Android app for rootless system-wide audio…☆17Aug 21, 2023Updated 2 years ago
- ☆30Oct 21, 2025Updated 4 months ago
- Gain information about applications to inform deployments☆11Mar 3, 2022Updated 3 years ago
- Unit-aware Computations for AI-driven Scientific Computing.☆17Jan 30, 2026Updated last month
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- ☆12Aug 2, 2022Updated 3 years ago
- 《챗GPT와 랭체인을 활용한 LLM 기반 AI 앱 개발》(2024년 6월 출간) 예제 코드☆11Mar 7, 2025Updated 11 months ago
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- Exposes batch message receives (recvmmsg)☆14Aug 15, 2025Updated 6 months ago