aronsar / hoadLinks
☆14Updated 3 years ago
Alternatives and similar repositories for hoad
Users that are interested in hoad are comparing it to the libraries listed below
Sorting:
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- ☆54Updated last year
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 4 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆83Updated 3 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- ☆115Updated 2 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Updated 6 years ago
- ☆13Updated 2 years ago
- Soft Actor-Critic☆157Updated 7 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Updated 4 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆47Updated 4 years ago
- DMControl Generalization Benchmark☆187Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 4 years ago
- ☆53Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆163Updated 4 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆126Updated last year
- ☆40Updated 4 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Updated 3 years ago
- The Implementation of "Machine Theory of Mind", ICML 2018☆26Updated 3 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆34Updated 5 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago