gingkg / AlphaZero_Gomoku_PyTorchLinks
基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI
☆28Updated 4 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch
Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below
Sorting:
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆137Updated 4 months ago
- ☆409Updated last year
- ☆249Updated 2 weeks ago
- 《动手学强化学习》练习代码(Pytorch)☆18Updated 3 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆121Updated 3 years ago
- Honor of Kings AI Open Environment of Tencent☆778Updated last year
- This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based o…☆599Updated 6 months ago
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆162Updated 4 years ago
- A curated list of visual reinforcement learning resources☆441Updated 2 weeks ago
- ☆106Updated 4 months ago
- ☆49Updated 7 months ago
- 动手学强化学习代码☆65Updated last year
- basic algorithms of reinforcement learning☆215Updated 2 years ago
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆49Updated 3 years ago
- rl-papers☆48Updated 2 years ago
- ☆68Updated last year
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆55Updated 5 years ago
- 机器人走迷宫,Pytorch,强化学习,DQN。☆98Updated 4 years ago
- NeurIPS 2024 DACER☆152Updated 2 months ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆50Updated 8 months ago
- An easier PyTorch deep reinforcement learning library.☆244Updated 11 months ago
- 不围棋AI☆30Updated 3 years ago
- ☆95Updated this week
- ☆55Updated 6 months ago
- ☆90Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- OpenAI团队的深度强化学习教程中文版☆31Updated 5 years ago
- 深度强化学习各算法介绍与Pytorch实现☆73Updated last year
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆156Updated last year
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆61Updated 2 weeks ago