understanding-search / maze-transformerLinks
This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.
☆31Updated last year
Alternatives and similar repositories for maze-transformer
Users that are interested in maze-transformer are comparing it to the libraries listed below
Sorting:
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- ☆52Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆79Updated 3 years ago
- see github.com/understanding-search/maze-transformer☆10Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 5 months ago
- An Open-Ended Agentic Simulator☆52Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆47Updated 2 years ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆16Updated 2 months ago
- maze datasets for investigating OOD behavior of ML systems☆53Updated last week
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated last year
- ☆39Updated 11 months ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated 11 months ago
- A TinyStories LM with SAEs and transcoders☆13Updated 4 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆83Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 9 months ago
- ☆13Updated last year
- ☆56Updated 2 years ago
- Building blocks for productive research☆59Updated 3 weeks ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆67Updated 8 months ago
- Simple JAX Graphics Library.☆36Updated 9 months ago
- General Modules for JAX☆67Updated 4 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆25Updated last year
- ☆55Updated 9 months ago
- ☆82Updated 5 months ago