HumanCompatibleAI / human_aware_rlLinks
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆110Updated 2 years ago
Alternatives and similar repositories for human_aware_rl
Users that are interested in human_aware_rl are comparing it to the libraries listed below
Sorting:
- ☆202Updated 2 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆156Updated 2 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆155Updated 4 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆168Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- Learning to Incentivize Other Learning Agents☆35Updated 3 years ago
- Gridworld for MARL experiments☆144Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 4 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆141Updated last year
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Updated 3 years ago
- ☆134Updated last year
- ☆78Updated last year
- ☆54Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆183Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 7 months ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆216Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆86Updated 3 years ago
- ☆116Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆133Updated 4 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆130Updated last year
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Updated 3 years ago
- Soft Actor-Critic☆157Updated 7 years ago
- ☆114Updated 2 years ago