Danielhp95 / gym-kuhn-poker
Kuhn poker implemented in accordance to OpenAI gym interface
☆11Updated 4 years ago
Related projects: ⓘ
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆29Updated last year
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆19Updated last year
- ☆9Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Updated 4 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated last month
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- AlphaZero for continuous control tasks☆23Updated last year
- Code for magnetic mirror descent.☆13Updated 11 months ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 4 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- ☆35Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Model-Free-Episodic-Control implementation.☆17Updated 5 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆31Updated 4 years ago
- An implementation of MuZero in JAX.☆52Updated last year
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Standard interface for entity based reinforcement learning environments.☆35Updated 6 months ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- Open AI gym poker environment built using the clubs package☆26Updated 8 months ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆9Updated 5 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆17Updated 7 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated last year
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆34Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago