Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for stable-opponent-shaping
Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below
Sorting:
- A Framework for Equilibrium Learning in Sealed-Bid Auctions☆24Mar 17, 2023Updated 2 years ago
- ☆15May 15, 2021Updated 4 years ago
- Python wrapper for ACPC poker bot infrastructure☆13May 20, 2018Updated 7 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- ☆30Jun 4, 2022Updated 3 years ago
- ☆33May 21, 2020Updated 5 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆170Nov 24, 2025Updated 3 months ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- ☆24Feb 18, 2026Updated 2 weeks ago
- Cognite examples and documentation for python.☆15Nov 18, 2020Updated 5 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆14Feb 11, 2026Updated 3 weeks ago
- A rework plugin to read and inline css via @import☆23Oct 14, 2020Updated 5 years ago
- Et forsøk på å beskrive et minimum av hva en kommune kan forvente av tilgjengelige API operasjoner fra en moderne EPJ system☆14Jun 11, 2025Updated 8 months ago
- ☆11Dec 15, 2023Updated 2 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆39Jan 12, 2021Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆44Apr 28, 2021Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Dec 8, 2022Updated 3 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Small library for extracting values from graphs☆10Jun 19, 2019Updated 6 years ago
- Mis proyectos de marketing aplicando AI☆11Oct 31, 2025Updated 4 months ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- The source code the for the ICLR'24 paper "Stabilizing Backpropagation Through Time to Learn Complex Physics"☆11May 17, 2024Updated last year
- Small algorithm for getting Antoine's coefficient to calculate vapor pressure from NIST web book.☆12May 30, 2021Updated 4 years ago
- Matlab scripting OpenModelica interface☆12Mar 26, 2024Updated last year
- Deep Learning papers that enlightened me☆12Dec 22, 2017Updated 8 years ago
- Development of the DEXPI group: Specifications, Tools, Documents☆15Sep 24, 2021Updated 4 years ago
- a jax benchmark for ad hoc teamwork☆19Updated this week
- Computational singular perturbation analysis library☆12Sep 11, 2025Updated 5 months ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago