ciwang / policydistillation
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆21Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for policydistillation
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆71Updated 5 months ago
- Hierarchical Deep RL Network☆31Updated 7 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆33Updated 5 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)☆24Updated 3 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆35Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- ☆54Updated 8 months ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- ☆28Updated 2 years ago
- ☆83Updated 5 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- ☆25Updated 6 years ago
- ☆43Updated last year
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆32Updated 2 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- ☆59Updated 6 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆51Updated last year
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆30Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆51Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago