BentouAI / AlphaZero-Chain-Reaction
This work attempts to train AlphaZero agents on the game of Chain Reaction
☆24Updated 2 years ago
Alternatives and similar repositories for AlphaZero-Chain-Reaction:
Users that are interested in AlphaZero-Chain-Reaction are comparing it to the libraries listed below
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆211Updated 2 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆258Updated 2 years ago
- ☆44Updated 2 years ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆261Updated 4 months ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆177Updated 11 months ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆96Updated 2 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆125Updated 6 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 4 months ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆204Updated last month
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- ☆80Updated last year
- ☆28Updated last year
- Douzero with ResNet and GPU support for Windows☆41Updated 3 years ago
- A Massively Parallel Large Scale Self-Play Framework☆345Updated 2 years ago
- ☆40Updated last year
- Reinforcement learning and planning for Minecraft.☆177Updated last year
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆75Updated 7 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆87Updated last year
- rl on super-mario-bros☆53Updated 4 years ago
- ☆61Updated 6 years ago
- This project is implementation code of AlphaStar☆199Updated last year
- ☆164Updated last year
- 基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI☆24Updated 3 years ago
- Unified Reinforcement Learning Framework☆722Updated 7 months ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated 2 years ago
- Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code☆585Updated 3 weeks ago
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆31Updated 10 months ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆756Updated last year
- This repo sets up the environment to play Xiang Qi (chinese chess) following the OpenAI Gym framework.☆37Updated 2 years ago