neelnanda-io / Neuroscope
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆12Updated 2 years ago
Alternatives and similar repositories for Neuroscope:
Users that are interested in Neuroscope are comparing it to the libraries listed below
- A Mechanistic Interpretability Analysis of Grokking☆21Updated 2 years ago
- ☆20Updated 4 months ago
- Certified Reasoning with Language Models☆31Updated last year
- ☆17Updated 6 months ago
- ☆12Updated 3 weeks ago
- ☆16Updated last year
- ☆52Updated 5 months ago
- ☆19Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 9 months ago
- Enjoy puzzle-solving directly in your browser.☆23Updated 2 months ago
- ☆26Updated 11 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆20Updated 9 months ago
- we got you bro☆35Updated 7 months ago
- Harmonic Datasets☆36Updated 8 months ago
- ☆123Updated last month
- Measuring the situational awareness of language models☆34Updated last year
- ☆32Updated 2 weeks ago
- Understanding how features learned by neural networks evolve throughout training☆33Updated 4 months ago
- ☆26Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated 2 weeks ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆15Updated 3 months ago
- ☆60Updated last month
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- A TinyStories LM with SAEs and transcoders☆11Updated 2 months ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆26Updated 6 months ago
- Code for minimum-entropy coupling.☆31Updated 8 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 9 months ago
- Situational Awareness Dataset☆25Updated 2 months ago