JimOhman / model-based-rlLinks
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Updated 3 years ago
Alternatives and similar repositories for model-based-rl
Users that are interested in model-based-rl are comparing it to the libraries listed below
Sorting:
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆121Updated last year
- Re-implementations of SOTA RL algorithms.☆136Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆285Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆174Updated last month
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- ☆246Updated last year
- ☆324Updated last year
- ☆202Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆63Updated last year
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆50Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆182Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆187Updated last year
- Partially Observable Process Gym☆211Updated 6 months ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆90Updated 2 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆78Updated 3 years ago
- ☆306Updated 3 years ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆100Updated last year
- Keeping track of RL experiments☆165Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- ☆133Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Updated last week
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆103Updated 3 years ago
- Multi Task RL Baselines☆258Updated 4 years ago
- Pytorch Implementation of MuZero☆354Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆158Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆98Updated 5 years ago
- ☆359Updated 3 years ago