menglinjian / -
本论文题目为基于深度强化学习的德州扑克AI算法优化
☆19Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for -
- lecture32_AI挑战星际争霸II(强化学习)☆16Updated 2 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆161Updated 5 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆86Updated 3 years ago
- ☆158Updated last year
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆71Updated last year
- 天授中文文档☆55Updated 2 years ago
- 该论文主要介绍了美国卡内基梅隆大学团队,在多人德州扑克上的人工智能新思路,即不再简单寻找纳什均衡,而引入悔恨值的概念,自我博弈,并采用蒙特卡洛CFR方法,构建蓝图,该方法通用性强,该团队声称他们的德州扑克蓝图只在两枚CPU运算8天即可得出蓝图,即可以实现实时博弈。现已经有国…☆25Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- [ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger could also help you with multi-agent decision intelligence stud…☆463Updated last year
- 此项目中将上传我在B站《强化学习理论基础》系列视频中的板书、参考资料等内容。☆72Updated last year
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆53Updated 3 years ago
- ☆16Updated 2 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆16Updated 3 years ago
- Simple Reinforcement learning tutorials☆14Updated 5 years ago
- 动手学强化学习代码☆37Updated 10 months ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆152Updated 5 years ago
- ☆21Updated 2 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆234Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- An easier PyTorch deep reinforcement learning library.☆169Updated this week
- PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Gr…☆194Updated last year
- Reinforcement Learning Algorithms Based on PyTorch☆17Updated 2 years ago
- Tutorial for Reinforcement Learning☆172Updated 2 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆68Updated last year
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆133Updated 3 years ago
- Multiagent Reinforcement Learning Research Project☆118Updated last month
- 学习强化学习过程中的笔记和代码☆9Updated 4 years ago