Code for Model-Free Opponent Shaping (ICML 2022)
☆21Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for model-free-opponent-shaping
Users that are interested in model-free-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- a jax benchmark for ad hoc teamwork☆21Updated this week
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 3 years ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆241Feb 26, 2026Updated last month
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆19Apr 7, 2025Updated last year
- POPGym Library in JAX☆13Apr 15, 2024Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆51Jun 26, 2024Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆47May 21, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- Bayesian Optimization Meets Bayesian Optimal Stopping☆32Oct 24, 2020Updated 5 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Aug 18, 2016Updated 9 years ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- ☆10Apr 13, 2023Updated 3 years ago
- NiceWebRL is a Python library for quickly making human subject experiments that leverage machine reinforcement learning environments.☆82Jan 14, 2026Updated 3 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated 10 months ago
- Highly scalable 2D JAX physics engine.☆65Feb 20, 2026Updated last month
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆40Feb 18, 2026Updated 2 months ago
- Predictive Coding for Locally-Linear Control (ICML-2020)☆17Jul 22, 2024Updated last year
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 4 months ago