heiner / nle
The NetHack Learning Environment
☆42Updated this week
Related projects: ⓘ
- Nethack Learning Environment Wrapper for Language Interface☆33Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆189Updated 2 weeks ago
- ☆56Updated 3 weeks ago
- General Modules for JAX☆57Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- An implementation of MuZero in JAX.☆52Updated last year
- An Open-Ended Agentic Simulator☆17Updated last month
- ☆59Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- PAIRED in PyTorch 🔥☆56Updated last year
- Accelerated minigrid environments with JAX☆102Updated last month
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- ☆25Updated last week
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆46Updated 10 months ago
- ☆21Updated 2 years ago
- ☆141Updated 2 weeks ago
- Baselines for gymnax 🤖☆57Updated last year
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆54Updated last year
- Efficient baselines for autocurricula in JAX.☆165Updated 3 weeks ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- ☆46Updated last year
- ☆14Updated last month
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆22Updated 8 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Standard interface for entity based reinforcement learning environments.☆35Updated 6 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆10Updated 2 months ago
- Object Centric Atari games☆43Updated this week
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago