blathers23 / Tougou_Hifumi_v8
爱恩斯坦棋博弈程序Tougou Hifumi v8
☆7Updated 2 years ago
Alternatives and similar repositories for Tougou_Hifumi_v8
Users that are interested in Tougou_Hifumi_v8 are comparing it to the libraries listed below
Sorting:
- NJU程设实验项目三:爱因斯坦棋☆8Updated 5 years ago
- 亚马逊棋冠军程序细节☆7Updated last month
- 爱恩斯坦棋代码☆10Updated 4 years ago
- 不围棋AI☆29Updated 2 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆9Updated last month
- ☆14Updated last year
- 深度强化学习贪吃蛇游戏。拥有完整游戏环境与AI接口。(项目未完成)☆37Updated 5 years ago
- pytorch实现的一些MARL算法☆65Updated 4 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆97Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 8 months ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆13Updated 4 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆105Updated last month
- 《动手学强化学习》练习代码(Pytorch)☆15Updated 2 years ago
- 2048 environment for Reinforcement Learning and DQN algorithm☆40Updated 2 years ago
- 强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)☆29Updated 6 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- 基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI☆24Updated 3 years ago
- 本论文题目为基于深度强化学习的德州扑克AI算法优化☆24Updated 4 years ago
- BJTU, database training course project, Mini DBMS☆13Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 2 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆100Updated last year
- ☆15Updated last year
- ☆59Updated 4 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆26Updated 3 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆41Updated 7 years ago
- ☆16Updated 2 years ago
- 根据博弈树的启发式搜索过程、设计α-β剪枝算法和评价函数开发的一个五子棋人机博弈游戏。☆18Updated 3 years ago
- Meta RL codebase for Unstable Baselines☆21Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago