CogitoNTNU / AlphaZero
An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for AlphaZero
- A structured implementation of MuZero☆206Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆66Updated last year
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- Pytorch Implementation of MuZero☆343Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- AlphaZero in JAX☆69Updated 7 months ago
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆82Updated 4 years ago
- ☆65Updated 3 years ago
- ☆286Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆192Updated 2 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆191Updated last year
- A grid-world game engine for game AI research☆233Updated 7 months ago
- Code for the paper "Phasic Policy Gradient"☆252Updated last year
- ☆48Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆209Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆208Updated last month
- A suite of test scenarios for multi-agent reinforcement learning.☆622Updated 2 weeks ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆97Updated 2 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆456Updated 7 months ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆29Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆103Updated 3 months ago
- Benchmarking the Spectrum of Agent Capabilities☆389Updated 9 months ago
- Real-World RL Benchmark Suite☆347Updated 4 years ago
- ☆201Updated this week
- Sokoban environment for OpenAI Gym☆330Updated last year