ssokota / mmd
Code for magnetic mirror descent.
☆16Updated last year
Alternatives and similar repositories for mmd:
Users that are interested in mmd are comparing it to the libraries listed below
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆45Updated 8 months ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 4 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 4 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- ☆30Updated 4 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- A collection of matrix games in JAX☆10Updated 3 months ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- ☆17Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆32Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆48Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 4 years ago
- ☆9Updated 3 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆20Updated 4 months ago
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 11 months ago
- ☆31Updated 5 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Updated 2 years ago
- ☆41Updated 3 years ago
- Standard interface for entity based reinforcement learning environments.☆36Updated last year
- ☆74Updated this week
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆22Updated last month