wulfebw / muzero
A python implemenation of tabular MuZero for educational purposes
☆21Updated 5 years ago
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- ☆67Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- ☆44Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 6 months ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆58Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- ☆35Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- ☆92Updated 4 years ago
- Revisiting Rainbow☆74Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Convert DeepMind Control Suite to OpenAI gym environments.☆83Updated 5 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆18Updated 5 years ago