RuBP17 / AlphaDouLinks
A Doudizhu reinforcement learning AI
☆37Updated 3 months ago
Alternatives and similar repositories for AlphaDou
Users that are interested in AlphaDou are comparing it to the libraries listed below
Sorting:
- ☆45Updated 2 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆191Updated last year
- ☆39Updated 11 months ago
- Douzero with ResNet and GPU support for Windows☆43Updated 3 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆99Updated 2 years ago
- ☆48Updated last year
- mcc_second_guandan☆88Updated 2 years ago
- ☆23Updated 3 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- ☆50Updated 3 months ago
- Leaderboard and Visualization for RLCard☆388Updated last year
- MahjongZero: DouZero for Mahjong | 麻将AI☆31Updated 2 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆16Updated 5 years ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆287Updated 4 months ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated 11 months ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆21Updated 3 years ago
- ☆12Updated 2 years ago
- ☆12Updated 3 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆278Updated 2 years ago
- DQN_play_sekiro☆537Updated 11 months ago
- ☆13Updated 3 years ago
- Honor of Kings AI Open Environment of Tencent☆761Updated last year
- A Deep Reinforcment Learning Aproach to Texas Holdem☆36Updated 3 years ago
- Python Fan calculator for Chinese Standard Mahjong☆23Updated 7 months ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆108Updated last year
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Updated last year
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆35Updated 6 years ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆191Updated last year
- ☆13Updated 3 years ago