quangr / deepnash
☆13Updated 2 years ago
Alternatives and similar repositories for deepnash:
Users that are interested in deepnash are comparing it to the libraries listed below
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆37Updated 3 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆42Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆59Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 4 months ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 8 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆52Updated 2 months ago
- Code for magnetic mirror descent.☆15Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆29Updated 5 months ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆98Updated 2 years ago
- Classic MCTS example with mctx☆16Updated last year
- ☆38Updated 2 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆32Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆16Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆76Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆67Updated last week
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 2 years ago
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 5 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated last year
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated last year
- An unofficial implementation for online decision transformer☆39Updated 2 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 6 months ago