jianzhnie / RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆14Updated 5 months ago
Alternatives and similar repositories for RLZero:
Users that are interested in RLZero are comparing it to the libraries listed below
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆85Updated 2 weeks ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated last year
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆13Updated 4 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆80Updated last month
- ☆20Updated 9 months ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆15Updated 6 years ago
- Official repository of Action-Free Guide☆11Updated 2 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆19Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆25Updated 9 months ago
- ☆9Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 3 years ago
- Sample-Efficient Automated Deep Reinforcement Learning☆34Updated 4 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- A2C is a special case of PPO!☆19Updated 2 years ago
- A C++ pytorch implementation of MuZero☆36Updated 11 months ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- fast + parallel AlphaZero in JAX☆94Updated 3 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated 2 weeks ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago