Tencent / TStarBot1
☆48Updated 2 years ago
Alternatives and similar repositories for TStarBot1:
Users that are interested in TStarBot1 are comparing it to the libraries listed below
- ☆42Updated 3 years ago
- ☆71Updated 6 years ago
- ☆25Updated 4 years ago
- ☆4Updated 4 months ago
- This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube☆160Updated 6 years ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆102Updated 7 years ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆45Updated 7 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- Reinforcement learning framework to accelerate research☆204Updated 3 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆128Updated 2 years ago
- ☆143Updated 4 months ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆10Updated last year
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆183Updated 7 years ago
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 7 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆180Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- Random Network Distillation pytorch☆247Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago
- ☆33Updated 7 years ago
- MSC: A Dataset for Macro-Management in StarCraft II☆138Updated 2 years ago
- This project is implementation code of AlphaStar☆199Updated last year
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated 6 months ago