google-research / reincarnating_rl
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆91Updated last year
Related projects: ⓘ
- Baselines for gymnax 🤖☆57Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆76Updated 3 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆46Updated last year
- ☆56Updated 3 weeks ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆121Updated 3 weeks ago
- A collection of RL algorithms written in JAX.☆92Updated 2 years ago
- An API conversion tool for popular external reinforcement learning environments☆131Updated 3 months ago
- Deep Hierarchical Planning from Pixels☆85Updated last year
- Various reinforcement learning algorithms written in Jax + Flax☆21Updated last year
- Partially Observable Process Gym☆158Updated 2 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆66Updated 9 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆51Updated last month
- ☆17Updated last year
- A tool for aggregating and plotting MARL experiment data.☆57Updated 3 weeks ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆125Updated this week
- ☆43Updated 3 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- ☆100Updated 7 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆59Updated last month
- A web based platform for collecting human actions in reinforcement learning environments☆26Updated last year
- A tool for recording RL trajectories.☆91Updated last month