charleschen003 / doudizhu-rl
强化学习训练斗地主 / doudizhu AI using reinforcement learning.
☆14Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for doudizhu-rl
- A Doudizhu reinforcement learning AI☆10Updated last month
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆161Updated 6 months ago
- ☆158Updated last year
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆157Updated 3 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆161Updated 5 years ago
- ☆16Updated 2 years ago
- ☆25Updated 2 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆95Updated 10 months ago
- This project is implementation code of AlphaStar☆187Updated 10 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- pytorch实现的一些MARL算法☆64Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆50Updated 11 months ago
- Multiagent Reinforcement Learning Research Project☆117Updated last month
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- Translation and understanding of the Pop-art paper.☆17Updated 5 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- Simple Reinforcement learning tutorials☆14Updated 5 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year
- The implement of the policy gradient RL algorithm with pytorch☆37Updated 3 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- ☆38Updated 2 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆42Updated 4 years ago
- Douzero with ResNet and GPU support for Windows☆33Updated 2 years ago
- ☆121Updated 3 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆2Updated 5 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year