wulfebw / muzeroLinks
A python implemenation of tabular MuZero for educational purposes
☆21Updated 5 years ago
Alternatives and similar repositories for muzero
Users that are interested in muzero are comparing it to the libraries listed below
Sorting:
- ☆67Updated 3 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 11 months ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- ☆35Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 6 years ago
- Model-Free-Episodic-Control implementation.☆17Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- Implementation of Deepmind's Neural Episodic Control☆58Updated 7 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Updated 4 years ago
- ☆44Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 5 years ago
- Revisiting Rainbow☆75Updated 4 years ago
- A practical step-by-step guide to applying RUDDER☆35Updated 5 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- krazy grid world☆25Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆204Updated 4 years ago