facebookresearch / starcraft_defogger
Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger
☆31Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for starcraft_defogger
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018☆15Updated 6 years ago
- Separating value functions across time-scales.☆17Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamax☆14Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 6 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- ☆30Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Updated 6 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- ☆42Updated 5 years ago
- Easing non-convex optimization with neural networks.☆22Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆31Updated 6 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 6 years ago
- Solves AI, transcends reality, infiltrates your mind☆36Updated 7 years ago
- Tensorflow Implementation of Programmable Agents☆36Updated 7 years ago
- Mind-aware Multi-agent Management Reinforcement Learning☆81Updated 5 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10Updated 6 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- E2C implementation in PyTorch☆43Updated 7 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"☆55Updated 7 years ago
- TargetProp for RNNs☆28Updated 5 years ago
- Training Sonic with RLlib☆57Updated last year