facebookresearch / jps
Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"
☆51Updated last year
Alternatives and similar repositories for jps:
Users that are interested in jps are comparing it to the libraries listed below
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆99Updated 2 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated last year
- ☆120Updated 2 years ago
- PyTorch RL for Pommerman☆38Updated 6 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- ☆73Updated 8 months ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated last year
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆92Updated 6 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆107Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆47Updated 5 months ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆34Updated 3 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆73Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- ☆97Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- ☆120Updated 6 months ago
- FEN Code☆37Updated 5 years ago