jianzhnie / RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆16Updated 7 months ago
Alternatives and similar repositories for RLZero
Users that are interested in RLZero are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆91Updated 2 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 2 months ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Official repository of Action-Free Guide☆11Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- A C++ pytorch implementation of MuZero☆38Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆82Updated 9 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 4 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆28Updated 3 weeks ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆27Updated 10 months ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆59Updated 6 months ago
- On-Policy Policy Gradient Algorithms in JAX☆34Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆19Updated 2 years ago
- Implementations of Curious Replay for model-based adaptation.☆40Updated last year
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆40Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presenta…☆10Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆72Updated 2 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆39Updated 2 years ago
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆14Updated 3 months ago
- Distributional Successor Features Enable Zero-Shot Policy Optimization☆12Updated last month