Implementation of the Off Belief Learning algorithm.
☆49Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for off-belief-learning
Users that are interested in off-belief-learning are comparing it to the libraries listed below
Sorting:
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Jun 22, 2022Updated 3 years ago
- ☆11Apr 23, 2021Updated 4 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆52Nov 14, 2023Updated 2 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 4 years ago
- ☆14Jun 17, 2022Updated 3 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Reinforcement Learning Assembly☆92Sep 2, 2021Updated 4 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Nov 18, 2022Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆14Apr 25, 2024Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Jul 18, 2023Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- A Continual Multi-agent RL testbed based on Hanabi☆32Aug 1, 2021Updated 4 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆109Apr 17, 2023Updated 2 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆41Oct 5, 2022Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- ☆22May 20, 2021Updated 4 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- ☆61Apr 22, 2024Updated last year
- Official PyTorch implementation of "EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization"☆23Oct 24, 2021Updated 4 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- Code for magnetic mirror descent.☆17Oct 5, 2023Updated 2 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- ☆10Mar 10, 2021Updated 4 years ago
- Remote sensing Image Captioning is a special case of Image Captioning which solves the difficulties in processing the remote sensing imag…☆11Jun 16, 2021Updated 4 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆67May 22, 2021Updated 4 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learningThermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning☆10Dec 10, 2018Updated 7 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated last year