shoq / cfr
Monte Carlo Conterfactual Regret Minimization for imperfect information games
☆13Updated 5 years ago
Alternatives and similar repositories for cfr:
Users that are interested in cfr are comparing it to the libraries listed below
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Collection of game-theoretic algorithms for Poker☆29Updated 5 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- ☆30Updated 6 years ago
- Potential-Aware Imperfect-Recall Abstraction with Earth Mover’s Distance in Imperfect-Information Games☆16Updated 5 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆13Updated 5 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆167Updated 6 years ago
- Counterfactual Regret Minimization☆29Updated 6 years ago
- A RNN PokerBot implementing DeepStack strategies☆54Updated 7 years ago
- Counterfactual Regret Minimization for poker games☆20Updated 5 years ago
- ☆9Updated 5 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- Implemented the CFR+ and PureCFR algorithms in Python to find the optimal strategies to 2-player extensive-form games, which was also use…☆21Updated 5 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 6 months ago
- coms4995 Final Project Poker AI☆71Updated 6 years ago
- ☆24Updated 6 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- Using counter factual regret minimization to computer optimal ranges of hands for each decision☆48Updated 4 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- An implementation of Counterfactual Regret Minimization (CFR) via Temporal Difference (TD) learning☆22Updated 11 years ago
- OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning☆161Updated 5 years ago
- Python implementation of Deepstack☆80Updated 5 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- An attempt at a Python implementation of Pluribus, a No-Limits Hold'em Poker Bot☆101Updated 4 years ago
- ☆17Updated 5 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago