kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
☆156Updated 3 years ago
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- Pytorch Implementation of MuZero☆348Updated last year
- A structured implementation of MuZero☆207Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- ☆294Updated last month
- ♟️ Vectorized RL game environments in JAX☆439Updated this week
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆193Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆169Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆60Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆409Updated last year
- Code for the paper "Phasic Policy Gradient"☆259Updated last year
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆44Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 6 months ago
- Modular framework for Reinforcement Learning in python☆171Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆882Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆215Updated last year
- ☆213Updated 2 months ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆107Updated 11 months ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆293Updated last year
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated last year
- A grid-world game engine for game AI research☆238Updated 10 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆69Updated 2 months ago
- Dream to Control: Learning Behaviors by Latent Imagination☆524Updated 3 years ago
- impact-driven-exploration☆130Updated last year
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆392Updated this week
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆158Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆168Updated 6 months ago
- AlphaZero in JAX☆73Updated 10 months ago
- Partially Observable Process Gym☆175Updated 7 months ago
- ☆66Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆142Updated this week