samvelyan / minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
☆16Updated 2 months ago
Alternatives and similar repositories for minihack
Users that are interested in minihack are comparing it to the libraries listed below
Sorting:
- An Open-Ended Agentic Simulator☆48Updated 9 months ago
- ☆19Updated 3 months ago
- Synchronized Curriculum Learning for RL Agents☆45Updated last month
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆20Updated 6 months ago
- ☆45Updated last year
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Nethack Learning Environment Wrapper for Language Interface☆37Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- ☆77Updated last month
- SocialJax: sequential social dilemma environments☆27Updated this week
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated 10 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 6 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆67Updated 2 weeks ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023☆3Updated last year
- Object Centric Atari games☆76Updated last week
- A collection of matrix games in JAX☆11Updated 5 months ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆12Updated 9 months ago
- A tool for aggregating and plotting MARL experiment data.☆77Updated 3 months ago
- Simple JAX Graphics Library.☆36Updated 6 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- Python library for easily making web Apps to compare humans and AI☆26Updated last week
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆142Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year