jianzhnie / RLZeroLinks
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆16Updated 8 months ago
Alternatives and similar repositories for RLZero
Users that are interested in RLZero are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated 2 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆92Updated last week
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆87Updated 10 months ago
- Benchmarked implementations of Offline RL Algorithms.☆73Updated 3 months ago
- Official repository of Action-Free Guide☆11Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆104Updated 2 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆32Updated last month
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆28Updated 11 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆66Updated last year
- Explore and Control with Adversarial Surprise☆10Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 3 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- A C++ pytorch implementation of MuZero☆38Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated last year
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- On-Policy Policy Gradient Algorithms in JAX☆37Updated last year
- A PyTorch implementation of DeepMind's MuZero agent☆35Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆24Updated 2 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.