Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for stable-opponent-shaping
Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Framework for Equilibrium Learning in Sealed-Bid Auctions☆24Mar 17, 2023Updated 3 years ago
- Python wrapper for ACPC poker bot infrastructure☆13May 20, 2018Updated 8 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆34Oct 6, 2022Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆150Apr 13, 2023Updated 3 years ago
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- Experimenting with kernel density estimation and (soft) histograms using tensorflow data flow graphs.☆10Dec 28, 2017Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Planning Beyond the Sensing Horizon Using a Learned Context☆10Jun 9, 2020Updated 5 years ago
- Bayes-Nash equilibrium computation of combinatorial auctions☆14May 30, 2022Updated 3 years ago
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago
- Automatically generate documentation for Python scripts.☆16Dec 21, 2022Updated 3 years ago
- Tilting estimators for program evaluation for Python 3☆10Oct 31, 2019Updated 6 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15May 15, 2021Updated 5 years ago
- Zeroth-order Min-max Optimization☆13Jun 28, 2020Updated 5 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- Prof. S. Boyd's LaTeX Templates☆13Dec 18, 2018Updated 7 years ago
- Kore 2022 episode visualizer☆10May 29, 2022Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated last year
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Apr 15, 2026Updated last month
- ☆10Apr 23, 2021Updated 5 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Apr 19, 2018Updated 8 years ago
- ☆24Dec 13, 2018Updated 7 years ago
- Tool to parse and preview Evernote markdown notes.☆24Sep 22, 2016Updated 9 years ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- 适用于解决公司、学校电脑一段时间不使用网络即自动断网,需要网页登录验证问题,基于python3实现,可实时检测电脑网络连接状态,检测到断网后调用谷歌浏览器自动进行网页端登录验证,电脑不关机、本程序处于运行状态中,可实现电脑永不断网。搭配TeamViewer使用可实现无人值守…☆22Feb 15, 2019Updated 7 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Work in progress save editor for Monster Hunter: World☆11Aug 15, 2018Updated 7 years ago
- ☆14Aug 9, 2023Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 11 months ago
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 7 years ago
- Unofficial and Partial Implementation of Fast AutoAugment in Pytorch☆10Oct 3, 2023Updated 2 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago