facebookresearch / hanabi_SADLinks
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
☆102Updated 3 years ago
Alternatives and similar repositories for hanabi_SAD
Users that are interested in hanabi_SAD are comparing it to the libraries listed below
Sorting:
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 5 months ago
- Curiosity-driven Exploration by Self-supervised Prediction☆144Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆162Updated 5 years ago
- ☆62Updated 7 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆109Updated 2 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- Soft Actor-Critic☆156Updated 7 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- ☆203Updated 2 years ago
- ☆78Updated last year
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆194Updated 2 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆219Updated 2 years ago
- ☆131Updated last year
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆129Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated 2 years ago
- ☆122Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆268Updated 5 years ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆37Updated 3 years ago
- ☆114Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆189Updated 3 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆244Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 8 months ago
- impact-driven-exploration☆132Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 2 months ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆90Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆223Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆151Updated 4 years ago