understanding-search / maze-transformer
This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.
☆24Updated 3 weeks ago
Related projects: ⓘ
- maze datasets for investigating OOD behavior of ML systems☆14Updated last week
- Interpreting how transformers simulate agents performing RL tasks☆62Updated 10 months ago
- ☆11Updated 2 months ago
- An Open-Ended Agentic Simulator☆17Updated last month
- Scaling scaling laws with board games.☆36Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆55Updated 2 years ago
- ☆25Updated last week
- Universal Neurons in GPT2 Language Models☆25Updated 3 months ago
- General Modules for JAX☆57Updated last month
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- ☆14Updated last month
- ☆56Updated 3 weeks ago
- PyTorch Package For Quasimetric Learning☆38Updated last year
- ☆15Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated last year
- Minimal but scalable implementation of large language models in JAX☆17Updated 3 weeks ago
- Sparse Autoencoder Training Library☆18Updated last month
- ☆23Updated last year
- Mechanistic Interpretability for Transformer Models☆48Updated 2 years ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆20Updated 2 weeks ago
- ☆17Updated 3 months ago
- ☆10Updated last year
- ☆52Updated 8 months ago
- ☆18Updated last year
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆14Updated last year
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 3 months ago
- Code for minimum-entropy coupling.☆29Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆13Updated 2 months ago
- Language-annotated Abstraction and Reasoning Corpus☆76Updated last year