charleschen003 / doudizhu-rl
强化学习训练斗地主 / doudizhu AI using reinforcement learning.
☆15Updated 5 years ago
Alternatives and similar repositories for doudizhu-rl:
Users that are interested in doudizhu-rl are comparing it to the libraries listed below
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆160Updated 3 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆173Updated 10 months ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 2 years ago
- Douzero with ResNet and GPU support for Windows☆39Updated 3 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆164Updated 5 years ago
- ☆40Updated 2 years ago
- ☆162Updated last year
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆83Updated last year
- A Doudizhu reinforcement learning AI☆21Updated 2 months ago
- ☆29Updated 5 months ago
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- ☆20Updated 2 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆195Updated 2 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 4 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆201Updated 3 weeks ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- 强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)☆29Updated 6 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- ☆16Updated 3 years ago
- 本论文题目为基于深度强化学习的德州扑克AI算法优化☆23Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- Leaderboard and Visualization for RLCard☆374Updated last year
- 深度强化学习贪吃蛇游戏。拥有完整游戏环境与AI接口。(项目未完成)☆37Updated 5 years ago
- This project is implementation code of AlphaStar☆198Updated last year
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆15Updated 6 years ago
- Code for Weighted QMIX☆133Updated 4 years ago
- The code for maddpg using pytorch☆166Updated 4 years ago
- ☆32Updated 4 years ago
- [NeurIPS 2022] 1st Place Solution for the 3rd Neural MMO Challenge☆29Updated 2 years ago