JimOhman / model-based-rl
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for model-based-rl
- Baseline implementation of recurrent PPO using truncated BPTT☆125Updated 6 months ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆106Updated 9 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated last week
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆170Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆58Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆72Updated 2 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆81Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆153Updated 5 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆144Updated 2 weeks ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- ☆286Updated last year
- Benchmarking RL generalization in an interpretable way.☆133Updated 9 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆69Updated last year
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆45Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆86Updated 3 weeks ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 3 months ago
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆204Updated last year
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆29Updated 4 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆208Updated last month
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆114Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆156Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆39Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆148Updated 4 months ago