shehper/sparse-dictionary-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shehper/sparse-dictionary-learning)

shehper / sparse-dictionary-learning

An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"

☆67

Alternatives and similar repositories for sparse-dictionary-learning

Users that are interested in sparse-dictionary-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neelnanda-io / 1L-Sparse-Autoencoder
View on GitHub
☆141Oct 28, 2023Updated 2 years ago
wesg52 / llm-context-neurons
View on GitHub
Find context neurons in Pythia models.
☆13Jun 13, 2023Updated 3 years ago
kslav / cdr_mri
View on GitHub
This is the code corresponding to our publication introducing ConvDecoder with physics-based regularization (CD+r) for MRI
☆10Feb 6, 2026Updated 5 months ago
jnward / monosemanticity-repro
View on GitHub
Open source repro of "Towards Monosemanticity"
☆33May 6, 2024Updated 2 years ago
thomasahle / cce
View on GitHub
Clustered Compositional Embeddings
☆13Oct 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wtong98 / mlp-icl
View on GitHub
☆12Sep 16, 2024Updated last year
HoagyC / sparse_coding
View on GitHub
Using sparse coding to find distributed representations used by neural networks.
☆306Nov 10, 2023Updated 2 years ago
callummcdougall / sae-exercises-mats
View on GitHub
☆26Dec 20, 2023Updated 2 years ago
tim-lawson / mlsae
View on GitHub
Multi-Layer Sparse Autoencoders (ICLR 2025)
☆30Feb 6, 2026Updated 5 months ago
ai-safety-foundation / sparse_autoencoder
View on GitHub
Sparse Autoencoder for Mechanistic Interpretability
☆303Jul 20, 2024Updated 2 years ago
openai / sparse_autoencoder
View on GitHub
☆595Jul 19, 2024Updated 2 years ago
EleutherAI / sparsify
View on GitHub
Sparsify transformers with SAEs and transcoders
☆732Updated this week
yding5 / AdaptiveBinning
View on GitHub
Adaptive-binning for evaluation of confidence calibration
☆12Jul 28, 2019Updated 6 years ago
Obarads / Point_Cloud_Tutorial
View on GitHub
This repository contains tutorial code and supplementary note for point cloud processing.
☆12Jun 21, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kherud / neural-additive-models-pt
View on GitHub
PyTorch implementation for Neural Additive Models
☆25Dec 2, 2020Updated 5 years ago
METR / Measuring-Early-2025-AI-on-Exp-OSS-Devs
View on GitHub
Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity: https://metr.org/blog/2025-07-10-early-2025-ai-e…
☆16Feb 23, 2026Updated 4 months ago
alexjfoote / Neuron2Graph
View on GitHub
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
☆10Jun 6, 2023Updated 3 years ago
vsatyakumar / automatic-local-outlier-factor-tuning
View on GitHub
Python implementation of the local outlier factor tuning algorithm described in “Automatic Hyperparameter Tuning Method for Local Outlier…
☆10Aug 3, 2020Updated 5 years ago
openai / automated-interpretability
View on GitHub
☆1,082Mar 6, 2024Updated 2 years ago
TalnUPF / ConceptExtraction
View on GitHub
☆11Aug 15, 2023Updated 2 years ago
cooperleong00 / Awesome-LLM-Interpretability
View on GitHub
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
☆309Jan 22, 2026Updated 5 months ago
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
ejnnr / cupbearer
View on GitHub
A library for mechanistic anomaly detection
☆22Jan 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shikaiqiu / compute-better-spent
View on GitHub
☆63Oct 3, 2024Updated last year
MikaStars39 / FeatureAlignment
View on GitHub
FeatureAlignment = Alignment + Mechanistic Interpretability
☆35Mar 8, 2025Updated last year
jkallini / mission-impossible-language-models
View on GitHub
Code repository for the paper "Mission: Impossible Language Models."
☆56Sep 25, 2025Updated 9 months ago
astonzhang / Parameterization-of-Hypercomplex-Multiplications
View on GitHub
Implementation for the PHM paper at ICLR'21
☆13Mar 1, 2023Updated 3 years ago
sustcsonglin / flash-linear-rnn
View on GitHub
Implementations of various linear RNN layers using pytorch and triton
☆55Aug 4, 2023Updated 2 years ago
uu-sml / calibration
View on GitHub
Python package for evaluating model calibration in classification
☆20Nov 12, 2019Updated 6 years ago
yizhongw / truthfulqa_reeval
View on GitHub
☆12Mar 7, 2024Updated 2 years ago
Akirato / PERM-GaussianKG
View on GitHub
PERM GaussianKG
☆10Nov 24, 2021Updated 4 years ago
xbresson / Long_Tailed_Learning_Requires_Feature_Learning
View on GitHub
Repository for ICLR'23 Long-tailed Learning Requires Feature Learning
☆10Feb 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
inria-thoth / csa
View on GitHub
Official Pytorch implementation of Chromatic Graph Transformers
☆10Jun 14, 2023Updated 3 years ago
saprmarks / geometry-of-truth
View on GitHub
☆113Aug 8, 2024Updated last year
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
dahyun-kang / cub-200-2011-part-visualizer
View on GitHub
Visualization tool for CUB-200-2011 part keypoints (Wah et al.).
☆10Sep 17, 2021Updated 4 years ago
jontromanab / sq_grasp
View on GitHub
superquadrics based grasping
☆13Dec 4, 2018Updated 7 years ago
ledmaster / unified-embeddings
View on GitHub
Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems
☆15Nov 11, 2023Updated 2 years ago
shehper / AC-Solver
View on GitHub
A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for rein…
☆36Aug 11, 2025Updated 11 months ago