tigert1998 / rl-gobangLinks
AlphaZero implementation on Gomoku
☆18Updated 6 months ago
Alternatives and similar repositories for rl-gobang
Users that are interested in rl-gobang are comparing it to the libraries listed below
Sorting:
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆76Updated 7 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆210Updated 6 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- A beamer template for LAMDA lab at NJU☆14Updated 4 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆124Updated 11 months ago
- ☆16Updated 7 years ago
- This is the source code of Agar.io environment.☆24Updated 3 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- An RL-Friendly Vision-Language Model for Minecraft☆36Updated 10 months ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Updated last year
- Implementation of Deep Reinforcement Learning Benchmark Algorithms, including DQN, Double DQN, Dueling DQN, Reinforce, Actor-Critic, A2C,…☆17Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Updated last year
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain☆232Updated last year
- ☆145Updated 8 months ago
- Chinese Standard Mahjong Competition hosted by AILab in Peking University.☆113Updated 3 years ago
- ZJU standard C Compiler☆11Updated 8 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated 2 years ago
- ☆25Updated 3 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- Sokoban environment for OpenAI Gym☆378Updated last year
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- DQN with pytorch with on Breakout and SpaceInvaders☆25Updated 6 years ago
- ☆89Updated 2 years ago
- DMControl Generalization Benchmark☆175Updated last year
- Baselines and Datasets for Pokémon Showdown RL☆58Updated this week
- Deep Learning big homework of UCAS☆37Updated 6 years ago
- GPU cluster kubernetes configurations and usages☆34Updated 3 years ago