neelnanda-io / Neuroscope
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆12Updated 2 years ago
Alternatives and similar repositories for Neuroscope:
Users that are interested in Neuroscope are comparing it to the libraries listed below
- ☆18Updated 6 months ago
- A Mechanistic Interpretability Analysis of Grokking☆21Updated 2 years ago
- ☆16Updated last year
- Certified Reasoning with Language Models☆31Updated last year
- ☆26Updated 11 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆20Updated 9 months ago
- ☆20Updated 4 months ago
- ☆12Updated this week
- ☆34Updated 3 weeks ago
- Understanding how features learned by neural networks evolve throughout training☆33Updated 5 months ago
- ☆26Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆18Updated 2 months ago
- ☆19Updated last year
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆16Updated 4 months ago
- ☆53Updated 5 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 10 months ago
- ☆48Updated last year
- gzip Predicts Data-dependent Scaling Laws☆34Updated 9 months ago
- we got you bro☆35Updated 7 months ago
- Measuring the situational awareness of language models☆34Updated last year
- Minimum Description Length probing for neural network representations☆19Updated last month
- Sparse and discrete interpretability tool for neural networks☆59Updated last year
- ☆29Updated 10 months ago
- ☆61Updated 4 months ago
- Experimental LLM interface exploring new ways to use AI to improve human thinking☆15Updated 2 weeks ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆163Updated this week
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- epsilon machines and transformers!☆24Updated last week
- ☆9Updated 3 months ago
- look how they massacred my boy☆63Updated 5 months ago