shehper / sparse-dictionary-learning

An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"
29Updated 6 months ago

Related projects

Alternatives and complementary repositories for sparse-dictionary-learning