understanding-search / maze-transformerLinks
This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.
☆32Updated 2 months ago
Alternatives and similar repositories for maze-transformer
Users that are interested in maze-transformer are comparing it to the libraries listed below
Sorting:
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated last month
- Scaling scaling laws with board games.☆53Updated 2 years ago
- ☆16Updated last year
- Code for "Baba Is AI: Break the Rules to Beat the Benchmark"☆41Updated 4 months ago
- A TinyStories LM with SAEs and transcoders☆14Updated 9 months ago
- Atari-style POMDPs☆21Updated last week
- see github.com/understanding-search/maze-transformer☆10Updated 2 years ago
- An Open-Ended Agentic Simulator☆58Updated last year
- ☆56Updated last year
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆32Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 3 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆22Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Updated last month
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Updated 3 months ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆73Updated last year
- ☆19Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Updated 2 years ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆72Updated last year
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Updated last year
- ☆58Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆198Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆83Updated 3 years ago
- ☆57Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 3 years ago
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆20Updated last month