baskuit / R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
☆39Updated last year
Related projects ⓘ
Alternatives and complementary repositories for R-NaD
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆104Updated 8 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆199Updated 3 weeks ago
- Benchmarking RL generalization in an interpretable way.☆131Updated 8 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- ☆64Updated this week
- Evaluating long-term memory of reinforcement learning algorithms☆132Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆56Updated last year
- An API conversion tool for popular external reinforcement learning environments☆139Updated last month
- Datasets with baselines for offline multi-agent reinforcement learning.☆137Updated this week
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆123Updated 6 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆209Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆48Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated 6 months ago
- Deep Hierarchical Planning from Pixels☆90Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆150Updated 4 months ago
- ☆200Updated 9 months ago
- A Simplified Pytorch Version of the Dreamer Algorithm☆112Updated last year
- A collection of RL algorithms written in JAX.☆94Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆98Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆76Updated 3 months ago
- SBX: Stable Baselines Jax (SB3 + Jax)☆336Updated this week
- Partially Observable Process Gym☆166Updated 4 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆72Updated last year
- ☆147Updated 2 months ago
- A tool for aggregating and plotting MARL experiment data.☆61Updated this week
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆113Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆285Updated 9 months ago