tigert1998 / rl-gobangLinks
AlphaZero implementation on Gomoku
☆18Updated 10 months ago
Alternatives and similar repositories for rl-gobang
Users that are interested in rl-gobang are comparing it to the libraries listed below
Sorting:
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆216Updated 10 months ago
- Chinese Standard Mahjong Competition hosted by AILab in Peking University.☆117Updated 3 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆76Updated 7 years ago
- 2021 Spring☆18Updated last year
- ☆25Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- A beamer template for LAMDA lab at NJU☆16Updated 5 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated 2 years ago
- GPU cluster kubernetes configurations and usages☆34Updated 4 years ago
- ICLR'22 Programmatic Reinforcement Learning☆16Updated 2 years ago
- An PyTorch implementation of "Importance Weighted Actor-Learner Architectures" https://arxiv.org/abs/1802.01561☆12Updated 5 years ago
- This is the source code of Agar.io environment.☆24Updated 4 years ago
- Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"☆17Updated 2 years ago
- Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021☆39Updated 4 months ago
- ☆12Updated 3 years ago
- ☆16Updated 7 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆197Updated last year
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆137Updated last year
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆24Updated 2 years ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Updated 3 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 4 years ago
- A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain☆237Updated last year
- A set of competitive environments for Reinforcement Learning research.☆29Updated 3 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Updated 9 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- lecture notes of probability notes☆17Updated 5 years ago