HumanCompatibleAI / human_aware_rlLinks

Code for "On the Utility of Learning about Humans for Human-AI Coordination"

☆109

Alternatives and similar repositories for human_aware_rl

Users that are interested in human_aware_rl are comparing it to the libraries listed below

Sorting:

Stanford-ILIAD / PantheonRL
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…
☆152Updated last year
TonghanWang / ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
☆160Updated 2 years ago
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
ruizhaogit / maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆28Updated 2 years ago
facebookresearch / hanabi_SAD
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
☆102Updated 3 years ago
kandouss / marlgrid
Gridworld for MARL experiments
☆141Updated 4 years ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆149Updated 3 years ago
rraileanu / idaac
☆54Updated last year
011235813 / lio
Learning to Incentivize Other Learning Agents
☆34Updated 3 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated last year
YuhangSong / Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Updated 5 months ago
rosewang2008 / gym-cooking
🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…
☆208Updated 4 years ago
mila-iqia / spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆161Updated 3 years ago
polixir / NeoRL
Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets
☆124Updated 8 months ago
flowersteam / TeachMyAgent
TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.
☆76Updated last year
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
uoe-agents / LIAM
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆36Updated 2 years ago
YuhangSong / Arena-BuildingToolkit
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆84Updated 4 years ago
mengf1 / CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
☆65Updated 5 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆46Updated 3 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆192Updated 2 years ago
rll-research / BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆129Updated 3 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆139Updated 2 years ago
denisyarats / dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
☆219Updated last year
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
spitis / mrl
☆113Updated 2 years ago
wendelinboehmer / dcg
☆75Updated last year
microsoft / oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆69Updated last year