Danielhp95 / gym-kuhn-pokerLinks
Kuhn poker implemented in accordance to OpenAI gym interface
☆14Updated 6 years ago
Alternatives and similar repositories for gym-kuhn-poker
Users that are interested in gym-kuhn-poker are comparing it to the libraries listed below
Sorting:
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆84Updated 6 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Updated last year
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Updated 4 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated 3 months ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Updated 3 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- Vectorization techniques for fast population-based training.☆57Updated 3 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Updated 3 years ago
- ☆18Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Updated last week
- ☆35Updated 7 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Updated 4 years ago
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- ☆60Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Updated 3 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Updated 5 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Updated 6 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Updated 4 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 7 years ago
- AlphaZero for continuous control tasks☆23Updated 3 years ago
- MultiTask Environments for Reinforcement Learning.☆79Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago