charleschen003 / doudizhu-rl
强化学习训练斗地主 / doudizhu AI using reinforcement learning.
☆15Updated 5 years ago
Alternatives and similar repositories for doudizhu-rl:
Users that are interested in doudizhu-rl are comparing it to the libraries listed below
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆164Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆158Updated 3 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆166Updated 8 months ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Updated 4 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆28Updated 2 years ago
- Douzero with ResNet and GPU support for Windows☆36Updated 3 years ago
- A Doudizhu reinforcement learning AI☆14Updated 2 weeks ago
- Multiagent Reinforcement Learning Research Project☆127Updated 3 months ago
- notes☆26Updated 2 years ago
- pytorch实现的一些MARL算法☆65Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆37Updated 4 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆73Updated last year
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆98Updated last year
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆184Updated 2 years ago
- ☆70Updated 11 months ago
- ☆20Updated 2 years ago
- ☆38Updated 2 years ago
- ☆159Updated last year
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆112Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆140Updated last year
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆191Updated 4 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆83Updated last year
- A collection of multi agent environments based on OpenAI gym.☆21Updated last year
- 强化学习中纳什Qlearning 实现矩阵博弈☆29Updated 5 years ago
- Code for Weighted QMIX☆126Updated 4 years ago
- ☆18Updated 3 years ago
- (JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play …☆322Updated 2 years ago