Code for Model-Free Opponent Shaping (ICML 2022)
☆20Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for model-free-opponent-shaping
Users that are interested in model-free-opponent-shaping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- a jax benchmark for ad hoc teamwork☆21Updated this week
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆152Apr 13, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Advantage Alignment Algorithms (ICLR 2025 oral)☆17Apr 7, 2025Updated 11 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Aug 26, 2022Updated 3 years ago
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Efficient baselines for autocurricula in JAX.☆211Aug 24, 2024Updated last year
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆51Jun 26, 2024Updated last year
- ☆47May 21, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 7 months ago
- ☆16Jul 16, 2024Updated last year
- Bayesian Optimization Meets Bayesian Optimal Stopping☆32Oct 24, 2020Updated 5 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Baselines for gymnax 🤖☆75Apr 3, 2023Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆15Sep 22, 2023Updated 2 years ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- ☆10Apr 13, 2023Updated 2 years ago
- NiceWebRL is a Python library for quickly making human subject experiments that leverage machine reinforcement learning environments.☆81Jan 14, 2026Updated 2 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- Highly scalable 2D JAX physics engine.☆64Feb 20, 2026Updated last month
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated 9 months ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆40Feb 18, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 9 months ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆42Oct 5, 2022Updated 3 years ago
- Literature and code for inverse reinforcement leanring research☆29Mar 6, 2020Updated 6 years ago
- A tool for aggregating and plotting MARL experiment data.☆84Jan 26, 2026Updated 2 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year