Sparse probing paper full code.
☆67Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for sparse-probing-paper
Users that are interested in sparse-probing-paper are comparing it to the libraries listed below
Sorting:
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- ☆100Aug 8, 2024Updated last year
- Sparse Autoencoder for Mechanistic Interpretability☆292Jul 20, 2024Updated last year
- Mechanistic Interpretability Visualizations using React☆328Dec 18, 2024Updated last year
- Training Sparse Autoencoders on Language Models☆1,233Feb 27, 2026Updated last week
- ☆20Feb 17, 2023Updated 3 years ago
- ☆36Apr 30, 2024Updated last year