Danielhp95 / gym-kuhn-pokerLinks
Kuhn poker implemented in accordance to OpenAI gym interface
☆14Updated 5 years ago
Alternatives and similar repositories for gym-kuhn-poker
Users that are interested in gym-kuhn-poker are comparing it to the libraries listed below
Sorting:
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆74Updated 2 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 3 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆53Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆11Updated 6 months ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆26Updated last year
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago
- ☆31Updated 6 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆82Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 4 years ago
- ☆18Updated 3 years ago
- krazy grid world☆25Updated 5 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆197Updated 2 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆21Updated 4 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13Updated 3 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Updated 4 years ago