aletcher/stable-opponent-shaping

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aletcher/stable-opponent-shaping)

aletcher / stable-opponent-shaping

Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).

☆21

Alternatives and similar repositories for stable-opponent-shaping

Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JakubPetriska / acpc-python-client
View on GitHub
Python wrapper for ACPC poker bot infrastructure
☆13May 20, 2018Updated 8 years ago
google-deepmind / symplectic-gradient-adjustment
View on GitHub
A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"
☆154Dec 6, 2018Updated 7 years ago
ucl-dark / pax
View on GitHub
Scalable Opponent Shaping Experiments in JAX
☆27Apr 13, 2024Updated 2 years ago
alshedivat / lola
View on GitHub
Code release for Learning with Opponent-Learning Awareness and variations.
☆152Apr 13, 2023Updated 3 years ago
yexf308 / MachineLearning
View on GitHub
Machine Learning Course From Scratch
☆13Jul 24, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
bryangraham / ipt
View on GitHub
Tilting estimators for program evaluation for Python 3
☆10Oct 31, 2019Updated 6 years ago
pablodecm / kernelflow
View on GitHub
Experimenting with kernel density estimation and (soft) histograms using tensorflow data flow graphs.
☆10Dec 28, 2017Updated 8 years ago
mit-acl / dc2g
View on GitHub
Planning Beyond the Sensing Horizon Using a Learned Context
☆10Jun 9, 2020Updated 6 years ago
monika76five / Probability
View on GitHub
☆10Aug 13, 2021Updated 4 years ago
marketdesignresearch / CA-BNE
View on GitHub
Bayes-Nash equilibrium computation of combinatorial auctions
☆14May 30, 2022Updated 4 years ago
amkatrutsa / advanced-opt
View on GitHub
Presentations of the advanced topics in optimization
☆11Oct 30, 2019Updated 6 years ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
michaelsyao / Machine-Learning-and-Reinforcement-Learning-in-Finance
View on GitHub
Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering
☆13Nov 8, 2018Updated 7 years ago
iassael / torch-ddcnn
View on GitHub
From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)
☆42Nov 3, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jaekyeom / drop-bottleneck
View on GitHub
☆16May 15, 2021Updated 5 years ago
gcucurull / maml_flax
View on GitHub
Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.
☆21Sep 18, 2020Updated 5 years ago
info-structures / ais
View on GitHub
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆23Nov 29, 2025Updated 8 months ago
KaidiXu / ZO-minmax
View on GitHub
Zeroth-order Min-max Optimization
☆13Jun 28, 2020Updated 6 years ago
ysc3839 / Mi6-MIPay-Systemless
View on GitHub
Bring MI Pay to MIUI Global.
☆15Nov 26, 2019Updated 6 years ago
michaelchanwahyan / latex_templates
View on GitHub
Prof. S. Boyd's LaTeX Templates
☆13Dec 18, 2018Updated 7 years ago
LlamaTouch / AgentEnv
View on GitHub
An environment for mobile angets to interact with realistic android device or android emulator
☆13Jul 19, 2024Updated 2 years ago
hhexiy / opponent
View on GitHub
Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
☆71Apr 15, 2026Updated 3 months ago
brandinho / Poker-Probability-Approximation
View on GitHub
☆24Dec 13, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
asappresearch / emergent-comms-negotiation
View on GitHub
Reproduce ICLR2018 submission "Emergent Communication through Negotiation"
☆17Apr 19, 2018Updated 8 years ago
regedor / Evermark
View on GitHub
Tool to parse and preview Evernote markdown notes.
☆24Sep 22, 2016Updated 9 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆10Apr 23, 2021Updated 5 years ago
sdpa-python / sdpa-python
View on GitHub
SemiDefinite Programming Algorithm (SDPA) for Python
☆12Jul 1, 2026Updated 3 weeks ago
moptimization / pythondice2013implementation
View on GitHub
This is a port of the 2013 dice model by William Nordhaus from GAMS to python using pyomo.
☆13Apr 16, 2016Updated 10 years ago
mit-aera / OptiTrack-Motive-2-Client
View on GitHub
ROS and LCM drivers for OptiTrack's Motive 2 software. Optimized for tracking aerial drones. Runs on Ubuntu Linux.
☆21Jul 28, 2020Updated 6 years ago
andrewschreiber / agent
View on GitHub
Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 7 years ago
mind-palace-laeqa / benchmark
View on GitHub
☆18Oct 31, 2025Updated 8 months ago
matrl-project / matrl
View on GitHub
☆12Jan 30, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YiteWang / lemon-pytorch
View on GitHub
This is the unofficial implementation of LEMON (ICLR'2024).
☆13Apr 14, 2024Updated 2 years ago
DHDev0 / Muzero-unplugged
View on GitHub
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆36Jun 25, 2025Updated last year
ZishunYu / Actor-Critic-Alignment
View on GitHub
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
facebookresearch / off-belief-learning
View on GitHub
Implementation of the Off Belief Learning algorithm.
☆49Aug 18, 2022Updated 3 years ago
anruoran / Network-State-Detection-and-AutoLogin
View on GitHub
适用于解决公司、学校电脑一段时间不使用网络即自动断网，需要网页登录验证问题，基于python3实现，可实时检测电脑网络连接状态，检测到断网后调用谷歌浏览器自动进行网页端登录验证，电脑不关机、本程序处于运行状态中，可实现电脑永不断网。搭配TeamViewer使用可实现无人值守…
☆22Feb 15, 2019Updated 7 years ago
vsyrgkanis / adversarial_gmm
View on GitHub
Prototype code for paper: Adversarial Generalized Method of Moments, Greg Lewis and Vasilis Syrgkanis
☆13Oct 21, 2020Updated 5 years ago
Nexusphobiker / MHWSaveEditor
View on GitHub
Work in progress save editor for Monster Hunter: World
☆11Aug 15, 2018Updated 7 years ago