1310183534 / DouDiZhu
☆12Updated 3 years ago
Alternatives and similar repositories for DouDiZhu:
Users that are interested in DouDiZhu are comparing it to the libraries listed below
- ☆38Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆98Updated 2 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 3 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆32Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- ☆13Updated 2 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- ☆20Updated 2 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 8 months ago
- ☆12Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆37Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 4 months ago
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆22Updated 4 months ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated last year
- ☆9Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- ☆139Updated last month
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆19Updated 2 years ago
- RL environment replicating the werewolf game to study emergent communication☆19Updated last year
- ☆18Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 5 months ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆44Updated 2 years ago
- ☆12Updated 3 years ago
- Code for magnetic mirror descent.☆15Updated last year
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago