menglinjian / -
本论文题目为基于深度强化学习的德州扑克AI算法优化
☆23Updated 4 years ago
Alternatives and similar repositories for -:
Users that are interested in - are comparing it to the libraries listed below
- 该论文主要介绍了美国卡内基梅 隆大学团队,在多人德州扑克上的人工智能新思路,即不再简单寻找纳什均衡,而引入悔恨值的概念,自我博弈,并采用蒙特卡洛CFR方法,构建蓝图,该方法通用性强,该团队声称他们的德州扑克蓝图只在两枚CPU运算8天即可得出蓝图,即可以实现实时博弈。现已经有国…☆25Updated 5 years ago
- A Deep Reinforcment Learning Aproach to Texas Holdem☆32Updated 2 years ago
- lecture32_AI挑战星际争霸II(强化学习)☆17Updated 2 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆83Updated last year
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- Douzero with ResNet and GPU support for Windows☆39Updated 3 years ago
- ☆20Updated 2 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆195Updated 2 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆72Updated 2 years ago
- ☆40Updated 2 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 11 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆153Updated last year
- PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Gr…☆216Updated last year
- 华为精英挑战赛德州扑克客户端AI代码☆26Updated 8 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- 多智能体学习库☆18Updated 3 years ago
- ☆162Updated last year
- notes☆27Updated 2 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆91Updated this week
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆173Updated 10 months ago
- ☆16Updated 3 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆170Updated 6 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆140Updated 5 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Updated 6 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Python implementation of Deepstack☆82Updated 5 years ago
- RL algorithms☆142Updated 4 years ago
- 多智能体强化学习☆90Updated 6 years ago
- 2048 environment for Reinforcement Learning and DQN algorithm☆40Updated 2 years ago