tigert1998 / rl-gobangLinks
AlphaZero implementation on Gomoku
☆18Updated 11 months ago
Alternatives and similar repositories for rl-gobang
Users that are interested in rl-gobang are comparing it to the libraries listed below
Sorting:
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆218Updated 11 months ago
- This is the source code of Agar.io environment.☆25Updated 4 years ago
- ☆25Updated 3 years ago
- A set of competitive environments for Reinforcement Learning research.☆29Updated 3 years ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Updated 3 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆139Updated last year
- ☆16Updated 7 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- Chinese Standard Mahjong Competition hosted by AILab in Peking University.☆118Updated 3 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 4 years ago
- A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain☆237Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆198Updated last year
- GPU cluster kubernetes configurations and usages☆34Updated 4 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- ☆12Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Updated 10 months ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13Updated 3 years ago
- Code accompanying paper, Forward Prediction for Physical Reasoning☆11Updated 4 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆47Updated last year
- ☆15Updated 2 years ago
- A beamer template for LAMDA lab at NJU☆16Updated 5 years ago
- Soft Actor-Critic☆156Updated 7 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago