aletcher / stable-opponent-shapingView external linksLinks
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for stable-opponent-shaping
Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below
Sorting:
- A Framework for Equilibrium Learning in Sealed-Bid Auctions☆24Mar 17, 2023Updated 2 years ago
- ☆15May 15, 2021Updated 4 years ago
- Python wrapper for ACPC poker bot infrastructure☆13May 20, 2018Updated 7 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆153Dec 6, 2018Updated 7 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- ☆30Jun 4, 2022Updated 3 years ago
- ☆33May 21, 2020Updated 5 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆167Nov 24, 2025Updated 2 months ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- BIM Profiles - Digital Information Exchange Requirements☆11Oct 21, 2020Updated 5 years ago
- Presentations of the advanced topics in optimization☆11Oct 30, 2019Updated 6 years ago
- Cognite examples and documentation for python.☆15Nov 18, 2020Updated 5 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆11Dec 15, 2023Updated 2 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆44Apr 28, 2021Updated 4 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆39Jan 12, 2021Updated 5 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Dec 8, 2022Updated 3 years ago
- The source code the for the ICLR'24 paper "Stabilizing Backpropagation Through Time to Learn Complex Physics"☆11May 17, 2024Updated last year
- Companies' House API☆10Oct 22, 2021Updated 4 years ago
- Expose Visual Components models through an OPC-UA server☆12Jul 18, 2018Updated 7 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Small library for extracting values from graphs☆10Jun 19, 2019Updated 6 years ago
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- MATLAB implementation of the universal directed information estimators in Jiantao Jiao, Haim H. Permuter, Lei Zhao, Young-Han Kim, and Ts…☆11Apr 2, 2019Updated 6 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- [ECCV 2020] Official Matlab implementation of rOSD: Toward unsupervised, multi-object discovery in large-scale image collections.☆10Nov 4, 2021Updated 4 years ago
- ☆10Jun 16, 2020Updated 5 years ago
- Python SDK for working with OSDU☆11Mar 4, 2025Updated 11 months ago
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago