submit-paper / Doudizhu_plus
☆40Updated 2 years ago
Alternatives and similar repositories for Doudizhu_plus:
Users that are interested in Doudizhu_plus are comparing it to the libraries listed below
- A Doudizhu reinforcement learning AI☆21Updated 2 months ago
- Douzero with ResNet and GPU support for Windows☆39Updated 3 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆173Updated 10 months ago
- ☆20Updated 2 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- ☆29Updated 5 months ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- ☆39Updated last year
- ☆12Updated 3 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆83Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 6 months ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 2 years ago
- ☆13Updated 2 years ago
- ☆12Updated 2 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- mcc_second_guandan☆76Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- Python and R tutorial for RLCard in Jupyter Notebook☆85Updated 3 years ago
- Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…☆37Updated 2 weeks ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆57Updated 2 years ago
- ☆18Updated 5 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- ☆18Updated 3 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 11 months ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- A simple 2D ball collision engine.☆12Updated last year
- Cloud client for douzero training☆11Updated 3 years ago
- ☆16Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago