openai / mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
☆614Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mlsh
- Code for the paper "Generative Adversarial Imitation Learning"☆690Updated 5 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆968Updated 5 years ago
- Implementation of TRPO and related algorithms☆620Updated 6 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆987Updated 5 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆808Updated last year
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,419Updated last year
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning☆436Updated 5 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆341Updated 5 years ago
- A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning ag…☆659Updated 5 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,099Updated 7 years ago
- ☆305Updated last year
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆302Updated last year
- Code for the paper "Large-Scale Study of Curiosity-Driven Learning"☆804Updated 3 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆592Updated 6 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Implementation of Meta-RL A3C algorithm☆400Updated 7 years ago
- Lua/Torch implementation of DQN (Nature, 2015)☆589Updated 7 years ago
- Code for the paper "Exploration by Random Network Distillation"☆879Updated 4 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆416Updated 5 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆416Updated 11 months ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆552Updated 5 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆655Updated 4 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆264Updated 6 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆400Updated 7 years ago
- Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"☆1,565Updated 5 years ago
- Implementations of Reinforcement Learning Models in Tensorflow☆487Updated 7 years ago
- A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface☆372Updated last year
- A starter agent that can solve a number of universe environments.☆1,101Updated 6 years ago
- Code for the paper "Evolved Policy Gradients"☆249Updated 5 years ago