understanding-search / maze-transformer
This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.
☆24Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for maze-transformer
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆15Updated 5 months ago
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- ☆13Updated 4 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆14Updated 7 months ago
- ☆26Updated last year
- maze datasets for investigating OOD behavior of ML systems☆16Updated 2 months ago
- Scaling scaling laws with board games.☆43Updated last year
- Minimal but scalable implementation of large language models in JAX☆26Updated 2 weeks ago
- ☆17Updated 5 months ago
- General Modules for JAX☆58Updated 3 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆12Updated 3 weeks ago
- ☆26Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆14Updated 3 weeks ago
- Interpreting how transformers simulate agents performing RL tasks☆73Updated last year
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- Simple JAX Graphics Library.☆23Updated 2 weeks ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆22Updated this week
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆27Updated this week
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆63Updated 2 years ago
- ☆63Updated 3 months ago
- Code for minimum-entropy coupling.☆30Updated 4 months ago
- A library for efficient patching and automatic circuit discovery.☆31Updated last month
- ☆10Updated 6 months ago
- ☆18Updated last year
- ☆15Updated 2 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆21Updated 4 months ago
- ☆50Updated 6 months ago
- Tools for studying developmental interpretability in neural networks.☆77Updated last week
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago