Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for stable-opponent-shaping
Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆34Oct 6, 2022Updated 3 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆154Dec 6, 2018Updated 7 years ago
- Scalable Opponent Shaping Experiments in JAX☆26Apr 13, 2024Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆150Apr 13, 2023Updated 3 years ago
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Experimenting with kernel density estimation and (soft) histograms using tensorflow data flow graphs.☆10Dec 28, 2017Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Presentations of the advanced topics in optimization☆11Oct 30, 2019Updated 6 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago
- Tilting estimators for program evaluation for Python 3☆10Oct 31, 2019Updated 6 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- ☆15May 15, 2021Updated 4 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆23Nov 29, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Bring MI Pay to MIUI Global.☆15Nov 26, 2019Updated 6 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Apr 15, 2026Updated 3 weeks ago
- ☆10Apr 23, 2021Updated 5 years ago
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Apr 19, 2018Updated 8 years ago
- ☆24Dec 13, 2018Updated 7 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- Action Recognition using Convolutional Neural Network (CNN)☆13Jun 10, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is a port of the 2013 dice model by William Nordhaus from GAMS to python using pyomo.☆12Apr 16, 2016Updated 10 years ago
- This is the unofficial implementation of LEMON (ICLR'2024).☆13Apr 14, 2024Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- 适用于解决公司、学校电脑一段时间不使用网络即自动断网,需要网页登录验证问题,基于python3实现,可实时检测电脑网络连接状态,检测到断网后调用谷歌浏览器自动进行网页端登录验证,电脑不关机、本程序处于运行状态中,可实现电脑永不断网。搭配TeamViewer使用可实现无人值守…☆22Feb 15, 2019Updated 7 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Myriad is a real-world testbed that aims to bridge trajectory optimization and deep learning.☆68Sep 12, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Dec 8, 2022Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Common utility functions and algorithms for robotics work used by ARC & ARM labs and TRI. This is a mirror of https://github.com/calderpg…☆13Mar 28, 2026Updated last month
- ☆22Dec 8, 2022Updated 3 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year