Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for stable-opponent-shaping
Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Framework for Equilibrium Learning in Sealed-Bid Auctions☆24Mar 17, 2023Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆153Dec 6, 2018Updated 7 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Bayes-Nash equilibrium computation of combinatorial auctions☆14May 30, 2022Updated 3 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Tilting estimators for program evaluation for Python 3☆10Oct 31, 2019Updated 6 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- ☆15May 15, 2021Updated 4 years ago
- Zeroth-order Min-max Optimization☆13Jun 28, 2020Updated 5 years ago
- Prof. S. Boyd's LaTeX Templates☆13Dec 18, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Kore 2022 episode visualizer☆10May 29, 2022Updated 3 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- ☆13Aug 9, 2023Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Work in progress save editor for Monster Hunter: World☆11Aug 15, 2018Updated 7 years ago
- Prototype code for paper: Adversarial Generalized Method of Moments, Greg Lewis and Vasilis Syrgkanis☆13Oct 21, 2020Updated 5 years ago
- A fast data loader for ImageNet on PyTorch.☆18Mar 17, 2019Updated 7 years ago
- SemiDefinite Programming Algorithm (SDPA) for Python☆12Jan 27, 2025Updated last year
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- Myriad is a real-world testbed that aims to bridge trajectory optimization and deep learning.☆67Sep 12, 2023Updated 2 years ago
- Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"☆15Dec 18, 2017Updated 8 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Unofficial and Partial Implementation of Fast AutoAugment in Pytorch☆10Oct 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- DeeperStacker: DeepHoldem Evil Brother☆37Oct 7, 2020Updated 5 years ago
- ☆30Feb 18, 2026Updated 2 months ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Common utility functions and algorithms for robotics work used by ARC & ARM labs and TRI. This is a mirror of https://github.com/calderpg…☆13Mar 28, 2026Updated 3 weeks ago
- Clean single-file implementation of offline RL algorithms in JAX☆174Nov 24, 2025Updated 4 months ago
- A collection of utilities for machine learning experiments.☆11Jan 8, 2026Updated 3 months ago