Danielhp95 / gym-kuhn-poker
Kuhn poker implemented in accordance to OpenAI gym interface
☆14Updated 5 years ago
Alternatives and similar repositories for gym-kuhn-poker:
Users that are interested in gym-kuhn-poker are comparing it to the libraries listed below
- Scalable Implementation of Neural Fictitous Self-Play☆76Updated 6 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆33Updated last year
- Code for magnetic mirror descent.☆16Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆34Updated 5 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆30Updated 7 months ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 4 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 8 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- Open AI gym poker environment built using the clubs package☆26Updated last year
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 7 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated 2 weeks ago
- A collection of matrix games in JAX☆10Updated 4 months ago