luchris429 / model-free-opponent-shapingView external linksLinks
Code for Model-Free Opponent Shaping (ICML 2022)
☆20Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for model-free-opponent-shaping
Users that are interested in model-free-opponent-shaping are comparing it to the libraries listed below
Sorting:
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- a jax benchmark for ad hoc teamwork☆17Updated this week
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆51Jun 26, 2024Updated last year
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- NiceWebRL is a Python library for quickly making human subject experiments that leverage machine reinforcement learning environments.☆79Jan 14, 2026Updated last month
- ☆47May 21, 2024Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated last year
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Aug 26, 2022Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Exploitability calculation for imperfect-information game benchmarks☆32Apr 5, 2025Updated 10 months ago
- Efficient baselines for autocurricula in JAX.☆206Aug 24, 2024Updated last year
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆229Jan 24, 2026Updated 3 weeks ago
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 6 months ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆33Nov 13, 2023Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- A lightweight driving simulator, written in Julia.☆19Sep 25, 2024Updated last year
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆16Apr 7, 2025Updated 10 months ago
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆20Feb 3, 2026Updated last week
- Codes related to Lord of the Machines hackathon☆10Apr 25, 2018Updated 7 years ago
- Gymnasium environment for research of UAVs and risk constraints☆12Oct 29, 2024Updated last year
- Predictable Feature Analysis☆10Dec 1, 2014Updated 11 years ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Feb 18, 2025Updated 11 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆27Nov 17, 2025Updated 3 months ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- ☆11May 8, 2016Updated 9 years ago
- Source code for ICML 2023 paper "Competing for Shareable Arms in Multi-Player Multi-Armed Bandits"☆10May 14, 2024Updated last year
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- An ontology of space situational awareness.☆11Mar 23, 2023Updated 2 years ago