understanding-search / maze-dataset
maze datasets for investigating OOD behavior of ML systems
☆16Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for maze-dataset
- ☆73Updated 4 months ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆24Updated 2 months ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- ☆53Updated 2 weeks ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆77Updated 3 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆91Updated this week
- ☆20Updated last year
- Minimal but scalable implementation of large language models in JAX☆26Updated 2 weeks ago
- Official Repo of LangSuitE☆78Updated 3 months ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- Interpreting how transformers simulate agents performing RL tasks☆73Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆32Updated last year
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 9 months ago
- Scaling scaling laws with board games.☆43Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 7 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆40Updated 3 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆19Updated 5 months ago
- A lightweight research framework☆21Updated 8 months ago
- ☆54Updated 3 years ago
- ☆46Updated 5 months ago
- ☆25Updated 3 weeks ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 7 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated 2 months ago
- PyTorch Package For Quasimetric Learning☆42Updated 3 weeks ago
- ☆77Updated 3 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆50Updated 5 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆80Updated last week
- Learning for effective and efficient bilevel planning☆95Updated this week