ciwang / policydistillation
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆21Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for policydistillation
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆33Updated 5 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Multi-Agent Determinantal Q-Learning☆41Updated last year
- ☆28Updated 2 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆40Updated 5 years ago
- ☆43Updated last year
- Hierarchical Deep RL Network☆30Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆25Updated 5 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- ☆82Updated 5 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆29Updated last year
- A Multi-agent Learning Framework☆62Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- ☆59Updated 6 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆51Updated last year