baskuit / R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
☆46Updated last year
Alternatives and similar repositories for R-NaD:
Users that are interested in R-NaD are comparing it to the libraries listed below
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆36Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 4 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆56Updated 4 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆142Updated last week
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 3 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- ☆73Updated 4 months ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆93Updated 4 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- ☆41Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆36Updated 3 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆137Updated 2 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆137Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆74Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆122Updated 3 years ago
- ☆77Updated 3 weeks ago
- Baselines for gymnax 🤖☆66Updated last year
- Partially Observable Process Gym☆183Updated 8 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆84Updated 2 years ago
- ☆74Updated this week
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆48Updated last year
- ☆217Updated 4 months ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 6 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆110Updated last year
- Deep Hierarchical Planning from Pixels☆95Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆67Updated 9 months ago