blathers23 / Tougou_Hifumi_v8Links
爱恩斯坦棋博弈程序Tougou Hifumi v8
☆8Updated 3 years ago
Alternatives and similar repositories for Tougou_Hifumi_v8
Users that are interested in Tougou_Hifumi_v8 are comparing it to the libraries listed below
Sorting:
- NJU程设实验项目三:爱因斯坦棋☆9Updated 6 years ago
- ☆14Updated last year
- 亚马逊棋冠军程序细节☆9Updated 3 months ago
- ☆36Updated last year
- 爱恩斯坦棋代码☆10Updated 4 years ago
- Meta RL codebase for Unstable Baselines☆21Updated 2 years ago
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆27Updated 8 months ago
- ☆15Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 10 months ago
- 2048 environment for Reinforcement Learning and DQN algorithm☆40Updated 3 years ago
- ☆49Updated 2 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆44Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆158Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆209Updated 4 months ago
- A PyTorch implementation of Implicit Q-Learning☆82Updated 3 years ago
- 基于miniGo的幻影围棋AI,2019中国计算机博弈大赛幻影围棋组冠军;AI of Phantom Go based on miniGo☆49Updated last year
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- Java实现的五子棋,人机对战中包含Alpha-Beta剪枝极大极小博弈算法☆13Updated 3 years ago
- 数据结构大作业,校园导航系统☆12Updated 6 years ago
- The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆56Updated last year
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆41Updated 3 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆43Updated 8 months ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆155Updated 2 years ago
- 不围棋AI☆29Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆56Updated last year
- MATE: the Multi-Agent Tracking Environment.☆38Updated 2 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆111Updated 3 years ago
- ☆12Updated last year