1310183534 / DouDiZhu
☆13Updated 3 years ago
Alternatives and similar repositories for DouDiZhu
Users that are interested in DouDiZhu are comparing it to the libraries listed below
Sorting:
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 4 years ago
- ☆45Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 8 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- ☆18Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆78Updated 6 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆18Updated last year
- ☆21Updated 2 years ago
- ☆12Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆101Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆47Updated 2 years ago
- ☆11Updated 4 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- ☆30Updated 2 years ago
- ☆9Updated 6 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated last year
- Code for magnetic mirror descent.☆16Updated last year
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated 2 years ago
- ☆9Updated 3 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- RL environment replicating the werewolf game to study emergent communication☆19Updated last year
- ☆13Updated 2 years ago
- ☆143Updated 5 months ago
- ☆18Updated 6 years ago