Stanford-ILIAD/Conventions-ModularPolicy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Stanford-ILIAD/Conventions-ModularPolicy)

Stanford-ILIAD / Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

☆15

Alternatives and similar repositories for Conventions-ModularPolicy

Users that are interested in Conventions-ModularPolicy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HumanCompatibleAI / human_aware_rl
View on GitHub
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆112Apr 17, 2023Updated 3 years ago
renweiya / RFQ-RFAC
View on GitHub
Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning
☆17Mar 11, 2020Updated 6 years ago
kennyderek / adap
View on GitHub
Adaptable Agent Populations via a Generative Model of Policies
☆12Oct 14, 2021Updated 4 years ago
LxzGordon / PECAN
View on GitHub
☆12Jan 4, 2024Updated 2 years ago
sjtu-marl / bd_rd_psro
View on GitHub
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆24Feb 27, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
facebookresearch / hanabi_SAD
View on GitHub
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
☆103Jun 22, 2022Updated 4 years ago
AndyShih12 / mac
View on GitHub
PyTorch implementation for "Training and Inference on Any-Order Autoregressive Models the Right Way", NeurIPS 2022 Oral, TPM 2023 Best Pa…
☆16May 31, 2023Updated 3 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆46Oct 29, 2020Updated 5 years ago
Amanda2024 / CARE-SMAC-MA_SAC
View on GitHub
Multi-task Multi-agent Soft Actor Critic for SMAC
☆15Jan 18, 2022Updated 4 years ago
rll-research / finetune-vs-metarl
View on GitHub
☆14May 31, 2022Updated 4 years ago
StephAO / HAHA
View on GitHub
Agents to play overcooked ai
☆15Jul 3, 2024Updated 2 years ago
facebookresearch / off-belief-learning
View on GitHub
Implementation of the Off Belief Learning algorithm.
☆49Aug 18, 2022Updated 3 years ago
ermongroup / SPN_Variational_Inference
View on GitHub
PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020
☆17Oct 11, 2021Updated 4 years ago
HumanCompatibleAI / overcooked-hAI-exp
View on GitHub
Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)
☆13May 10, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ademiadeniji / irm
View on GitHub
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆42Jan 13, 2024Updated 2 years ago
chandar-lab / Lifelong-Hanabi
View on GitHub
A Continual Multi-agent RL testbed based on Hanabi
☆31Aug 1, 2021Updated 4 years ago
pygame-web / archives
View on GitHub
archived prebuilts
☆14Jul 30, 2025Updated 11 months ago
yuqingd / cusp
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
stacyste / TheoryOfMindInferenceModels
View on GitHub
☆28Nov 22, 2019Updated 6 years ago
HumanCompatibleAI / overcooked-demo
View on GitHub
Web application where humans can play Overcooked with AI agents.
☆60Dec 6, 2022Updated 3 years ago
j96w / cogail
View on GitHub
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
☆52Nov 8, 2021Updated 4 years ago
rll-research / teachable
View on GitHub
☆17Oct 12, 2023Updated 2 years ago
alexzhou907 / DreamPropeller
View on GitHub
☆89Apr 18, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jparkerholder / PB2
View on GitHub
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.
☆20Apr 13, 2021Updated 5 years ago
ludc / gymecs
View on GitHub
☆25Nov 1, 2022Updated 3 years ago
ahmed-touati / controllable_agent
View on GitHub
☆61Jun 6, 2023Updated 3 years ago
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 3 years ago
aronsar / hoad
View on GitHub
☆14Jun 17, 2022Updated 4 years ago
fishmoon1234 / DAG-NoCurl
View on GitHub
☆28Dec 20, 2021Updated 4 years ago
njustesen / a2c_gvgai
View on GitHub
A2C for GVG-AI
☆22Nov 7, 2018Updated 7 years ago
gcucurull / jax-gat
View on GitHub
JAX implementation of Graph Attention Networks
☆13Jan 29, 2022Updated 4 years ago
microsoft / segar
View on GitHub
Sandbox environment for generalizable agent research
☆27Aug 19, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
deligentfool / SIDE
View on GitHub
Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"
☆11Jun 24, 2022Updated 4 years ago
xjtushujun / Auto-6ML
View on GitHub
Auto^6ML is a jittor library allowing users to achieve machine learning automation.
☆26Sep 28, 2024Updated last year
awjuliani / successor_examples
View on GitHub
Tutorials on learning and using successor representations.
☆54Oct 31, 2019Updated 6 years ago
GIS-PuppetMaster / Auto-STGCN
View on GitHub
source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…
☆11Jan 26, 2021Updated 5 years ago
YuhangSong / Arena-Baselines
View on GitHub
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Mar 6, 2025Updated last year
entity-neural-network / entity-gym
View on GitHub
Standard interface for entity based reinforcement learning environments.
☆39Feb 28, 2024Updated 2 years ago
amilner42 / interview-practice
View on GitHub
Questions and solutions (in Java) for technical CS interview problems
☆22Oct 9, 2015Updated 10 years ago