JimOhman / model-based-rl
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for model-based-rl
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆104Updated 8 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆56Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆137Updated this week
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- ☆188Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆98Updated 2 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆80Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆171Updated last year
- Partially Observable Process Gym☆166Updated 4 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆123Updated 6 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆199Updated 3 weeks ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆93Updated 4 years ago
- ☆228Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆132Updated 5 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- ☆200Updated 9 months ago
- Benchmarking RL generalization in an interpretable way.☆131Updated 8 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆39Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆32Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Keeping track of RL experiments☆159Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated 6 months ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆156Updated 2 years ago
- JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️☆196Updated 3 weeks ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated 2 years ago
- ☆285Updated last year