Code for Model-Free Opponent Shaping (ICML 2022)
☆23Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for model-free-opponent-shaping
Users that are interested in model-free-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scalable Opponent Shaping Experiments in JAX☆26Apr 13, 2024Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆34Oct 6, 2022Updated 3 years ago
- a jax benchmark for ad hoc teamwork☆22Updated this week
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆150Apr 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆253May 21, 2026Updated last week
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆21Aug 26, 2022Updated 3 years ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆19Apr 7, 2025Updated last year
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Efficient baselines for autocurricula in JAX.☆213Aug 24, 2024Updated last year
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Jun 26, 2024Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- ☆47May 21, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 9 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Apr 15, 2026Updated last month
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Baselines for gymnax 🤖☆76Apr 3, 2023Updated 3 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Apr 13, 2023Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated 11 months ago
- Highly scalable 2D JAX physics engine.☆67Apr 20, 2026Updated last month
- 论文Reinforcement Learning of Sequential Price Mechanisms的复现☆12Nov 3, 2022Updated 3 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆54Apr 20, 2026Updated last month
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 11 months ago
- Predictive Coding for Locally-Linear Control (ICML-2020)☆18Jul 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆43Oct 5, 2022Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.☆84Apr 13, 2026Updated last month
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- Exploitability calculation for imperfect-information game benchmarks☆34Apr 5, 2025Updated last year
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year