PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆15Mar 9, 2021Updated 4 years ago
Alternatives and similar repositories for Conventions-ModularPolicy
Users that are interested in Conventions-ModularPolicy are comparing it to the libraries listed below
Sorting:
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆109Apr 17, 2023Updated 2 years ago
- A Continual Multi-agent RL testbed based on Hanabi☆32Aug 1, 2021Updated 4 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆14Apr 25, 2024Updated last year
- ☆15Sep 7, 2022Updated 3 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12May 10, 2021Updated 4 years ago
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆16Mar 11, 2020Updated 5 years ago
- ☆14May 31, 2022Updated 3 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Jun 22, 2022Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Jan 13, 2024Updated 2 years ago
- ☆14Jun 17, 2022Updated 3 years ago
- ☆23Feb 15, 2022Updated 4 years ago
- ☆22May 20, 2021Updated 4 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- ☆27Dec 20, 2021Updated 4 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 4 years ago
- Game-based AI Platforms☆27Jun 27, 2024Updated last year
- ☆57Jun 6, 2023Updated 2 years ago
- Tutorials on learning and using successor representations.☆54Oct 31, 2019Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration☆53Nov 8, 2021Updated 4 years ago
- curriculum☆27Feb 7, 2023Updated 3 years ago
- ☆28Nov 22, 2019Updated 6 years ago
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated 11 months ago
- RLA is a tool for managing your RL experiments automatically☆32Jan 11, 2025Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Apr 6, 2022Updated 3 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago