tigert1998 / rl-gobangLinks
AlphaZero implementation on Gomoku
☆18Updated 9 months ago
Alternatives and similar repositories for rl-gobang
Users that are interested in rl-gobang are comparing it to the libraries listed below
Sorting:
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆215Updated 9 months ago
- This is the source code of Agar.io environment.☆23Updated 4 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- ☆16Updated 7 years ago
- ☆25Updated 3 years ago
- Emergent collective intelligence from massive-agent cooperation and competition☆27Updated 2 years ago
- Chinese Standard Mahjong Competition hosted by AILab in Peking University.☆116Updated 3 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain☆236Updated last year
- GPU cluster kubernetes configurations and usages☆34Updated 4 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆22Updated 3 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆52Updated 2 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated 2 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆76Updated 7 years ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Updated last year
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Updated 3 years ago
- ☆113Updated 6 years ago
- ZJU standard C Compiler☆11Updated 8 years ago
- ☆133Updated last year
- ☆14Updated 3 years ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- DMControl Generalization Benchmark☆181Updated last year
- ☆12Updated 3 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆134Updated last year
- A python module designed for agile RL algorithm developing.☆26Updated last year
- ☆12Updated 3 years ago