wwxFromTju / sc2-101-zh
just for fun
☆23Updated 7 years ago
Alternatives and similar repositories for sc2-101-zh:
Users that are interested in sc2-101-zh are comparing it to the libraries listed below
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated 2 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Implement A3C for Mujoco gym envs☆72Updated 7 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆113Updated 7 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆63Updated 7 years ago
- ☆53Updated 8 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Updated 7 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 7 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"☆63Updated 7 years ago
- NIPS 2017 Value Prediction Network☆165Updated 7 years ago
- ☆3Updated 2 months ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 6 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆36Updated 6 years ago
- Convert sc2 environment to gym-atari and play some mini-games☆21Updated 7 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 6 years ago
- Hierarchical Deep RL Network☆31Updated 7 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 5 years ago
- ☆32Updated 4 years ago
- ☆33Updated 7 years ago
- CommNet and BiCnet implementation in tensorflow☆55Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated last month
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆180Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago