Danielhp95 / gym-kuhn-poker
Kuhn poker implemented in accordance to OpenAI gym interface
☆14Updated 5 years ago
Alternatives and similar repositories for gym-kuhn-poker:
Users that are interested in gym-kuhn-poker are comparing it to the libraries listed below
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆31Updated 9 months ago
- krazy grid world☆25Updated 5 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆78Updated 6 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆34Updated 5 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Updated 6 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Code for magnetic mirror descent.☆16Updated last year
- Open AI gym poker environment built using the clubs package☆29Updated last year
- General Modules for JAX☆64Updated last month
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆19Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 6 months ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated 2 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆52Updated last year
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆11Updated 4 years ago
- ☆41Updated 3 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 4 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆49Updated 2 years ago