pikaju / muzero-g
Reference implementation for the paper titled "Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision".
☆12Updated 4 years ago
Alternatives and similar repositories for muzero-g:
Users that are interested in muzero-g are comparing it to the libraries listed below
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- ☆98Updated 2 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆150Updated 4 years ago
- ☆26Updated 2 years ago
- Hindsight policy gradients☆45Updated 5 years ago
- ☆44Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆168Updated 3 months ago
- impact-driven-exploration☆131Updated last year
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆47Updated 4 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆30Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Benchmarking RL generalization in an interpretable way.☆154Updated last month
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆51Updated 3 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- A collection of RL algorithms written in JAX.☆97Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆142Updated last year
- Collection of reinforcement learning algorithms☆15Updated 3 years ago
- ☆42Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- ☆18Updated 2 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- ☆31Updated 2 years ago
- ☆53Updated 4 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆92Updated 2 years ago
- ☆31Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆112Updated 2 years ago