fidel-schaposnik / muzero
Tensorflow implementation of MuZero algorithm
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for muzero
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- ☆48Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆56Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated 6 months ago
- General Modules for JAX☆58Updated 3 months ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 3 years ago
- ☆12Updated 2 years ago
- Standard interface for entity based reinforcement learning environments.☆36Updated 8 months ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆32Updated 4 years ago
- Reinforcement learning training framework for entity-gym environments.☆15Updated 7 months ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆19Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated last year
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆16Updated last year
- Classic MCTS example with mctx☆15Updated last year
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆39Updated last year
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆44Updated last week
- ☆46Updated 6 months ago
- Baselines for gymnax 🤖☆59Updated last year
- Collection of in-progress libraries for entity neural networks.☆29Updated 2 years ago
- Scaling scaling laws with board games.☆40Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated 3 months ago
- A structured implementation of MuZero☆206Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆112Updated 3 months ago
- ☆65Updated 3 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago