HumanCompatibleAI / leela-interp
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for leela-interp
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆24Updated 2 months ago
- Code for minimum-entropy coupling.☆29Updated 4 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆20Updated 3 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆13Updated 6 months ago
- ☆44Updated last month
- Sparse Autoencoder Training Library