wwxFromTju / sc2-101-zh
just for fun
☆23Updated 7 years ago
Alternatives and similar repositories for sc2-101-zh:
Users that are interested in sc2-101-zh are comparing it to the libraries listed below
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated 2 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- ☆3Updated 3 months ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- ☆33Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- NIPS 2017 Value Prediction Network☆165Updated 7 years ago
- Tensorflow implementation of generative adversarial imitation learning☆199Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 2 weeks ago
- Collection of Deep Reinforcement Learning algorithms☆124Updated 7 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 5 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆129Updated last year
- ☆32Updated 4 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆182Updated 7 years ago
- Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"☆63Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆23Updated 6 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆97Updated 6 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- CommNet and BiCnet implementation in tensorflow☆55Updated 6 years ago