Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for stable-opponent-shaping
Users that are interested in stable-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Framework for Equilibrium Learning in Sealed-Bid Auctions☆24Mar 17, 2023Updated 3 years ago
- Python wrapper for ACPC poker bot infrastructure☆13May 20, 2018Updated 7 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆153Dec 6, 2018Updated 7 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code release for Learning with Opponent-Learning Awareness and variations.☆152Apr 13, 2023Updated 2 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- Experimenting with kernel density estimation and (soft) histograms using tensorflow data flow graphs.☆10Dec 28, 2017Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆97Aug 21, 2018Updated 7 years ago
- Presentations of the advanced topics in optimization☆11Oct 30, 2019Updated 6 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆42Nov 3, 2016Updated 9 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- ☆15May 15, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Zeroth-order Min-max Optimization☆13Jun 28, 2020Updated 5 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- Prof. S. Boyd's LaTeX Templates☆13Dec 18, 2018Updated 7 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Apr 19, 2018Updated 7 years ago
- ☆24Dec 13, 2018Updated 7 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Action Recognition using Convolutional Neural Network (CNN)☆13Jun 10, 2018Updated 7 years ago
- ROS and LCM drivers for OptiTrack's Motive 2 software. Optimized for tracking aerial drones. Runs on Ubuntu Linux.☆20Jul 28, 2020Updated 5 years ago
- This is the unofficial implementation of LEMON (ICLR'2024).☆12Apr 14, 2024Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Work in progress save editor for Monster Hunter: World☆11Aug 15, 2018Updated 7 years ago
- A fast data loader for ImageNet on PyTorch.☆18Mar 17, 2019Updated 7 years ago
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 9 months ago
- Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"☆15Dec 18, 2017Updated 8 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Unofficial and Partial Implementation of Fast AutoAugment in Pytorch☆10Oct 3, 2023Updated 2 years ago