HumanCompatibleAI / leela-interp
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for leela-interp
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆24Updated 2 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆14Updated 7 months ago
- Minimal but scalable implementation of large language models in JAX☆26Updated 2 weeks ago
- ☆17Updated 5 months ago
- Scaling scaling laws with board games.☆43Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆16Updated 3 weeks ago
- ☆13Updated 4 months ago
- ☆18Updated 7 months ago
- ☆26Updated last year
- ☆44Updated last month
- A library for efficient patching and automatic circuit discovery.☆31Updated last month
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆14Updated 3 weeks ago
- Code for minimum-entropy coupling.☆30Updated 4 months ago
- ☆25Updated 3 weeks ago
- Sparse Autoencoder Training Library☆27Updated 3 weeks ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆25Updated 5 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆21Updated 3 weeks ago
- Sparse and discrete interpretability tool for neural networks☆55Updated 9 months ago
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- ☆17Updated 10 months ago
- ☆44Updated this week
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆20Updated 3 months ago
- Experiments with representation engineering☆10Updated 8 months ago
- ☆18Updated last year
- ☆50Updated 6 months ago
- ☆26Updated 2 months ago
- ☆20Updated last week