jianzhnie / RLZeroLinks
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆16Updated 10 months ago
Alternatives and similar repositories for RLZero
Users that are interested in RLZero are comparing it to the libraries listed below
Sorting:
- A C++ pytorch implementation of MuZero☆40Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆70Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆31Updated 2 months ago
- MiniZero: An AlphaZero and MuZero Training Framework☆98Updated last month
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆93Updated last year
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated 2 years ago
- ☆11Updated 4 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆37Updated 5 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Updated 4 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Minimal code for A Generalist Agent☆42Updated 2 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆100Updated last week
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- ☆31Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- ☆32Updated 5 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated last year
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆34Updated last year
- ☆24Updated last year
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆58Updated 4 months ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆53Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆40Updated 4 years ago
- On-Policy Policy Gradient Algorithms in JAX☆39Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆19Updated last year