Danielhp95 / gym-kuhn-pokerLinks
Kuhn poker implemented in accordance to OpenAI gym interface
☆14Updated 5 years ago
Alternatives and similar repositories for gym-kuhn-poker
Users that are interested in gym-kuhn-poker are comparing it to the libraries listed below
Sorting:
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆118Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆50Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆89Updated 4 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆18Updated 4 months ago
- Code for the paper "Batch size invariance for policy optimization"☆53Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆70Updated last year
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Open AI gym poker environment built using the clubs package☆32Updated last year
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆192Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆161Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- The source code for the gym-microrts paper.☆42Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- ☆18Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Reinforcement learning algorithms in RLlib☆59Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆133Updated last year
- Curiosity-driven Exploration by Self-supervised Prediction☆140Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 5 months ago
- impact-driven-exploration☆132Updated last year