OpenMOSS / Language-Model-SAEs

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
87Updated this week

Alternatives and similar repositories for Language-Model-SAEs:

Users that are interested in Language-Model-SAEs are comparing it to the libraries listed below