cqian19 / qmix-plus
Improving upon state of the art cooperative deep reinforcement learning in StarCraft II
☆13Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for qmix-plus
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆33Updated 3 years ago
- FEN Code☆37Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆33Updated 5 years ago
- ☆18Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Updated 2 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆15Updated 3 months ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆29Updated 2 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆21Updated 4 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆20Updated last year
- Exploration by Random Network Distillation☆16Updated 5 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning.☆19Updated last year
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- Multi-Objective Deep Reinforcement Learning☆41Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago