Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"
☆348Nov 22, 2018Updated 7 years ago
Alternatives and similar repositories for vime
Users that are interested in vime are comparing it to the libraries listed below
Sorting:
- Code for the paper "Generative Adversarial Imitation Learning"☆730Nov 22, 2018Updated 7 years ago
- Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"☆529Nov 22, 2018Updated 7 years ago
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Advers…☆1,071Mar 25, 2021Updated 4 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,045Jun 10, 2023Updated 2 years ago
- Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"☆145Nov 22, 2018Updated 7 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,470Dec 7, 2022Updated 3 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- Code for the paper "Evolved Policy Gradients"☆253Nov 22, 2018Updated 7 years ago
- Code for the paper "Improved Techniques for Training GANs"☆2,334Nov 21, 2018Updated 7 years ago
- A starter agent that can solve a number of universe environments.☆1,105Apr 7, 2018Updated 7 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Code for the paper "Large-Scale Study of Curiosity-Driven Learning"☆830Aug 12, 2021Updated 4 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 2 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Jul 20, 2018Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- Wikipedia navigation environment for OpenAI Gym☆41Apr 2, 2023Updated 2 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- Guided Policy Search☆604Feb 9, 2021Updated 5 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Aug 9, 2018Updated 7 years ago
- Code for the paper "Improving GANs Using Optimal Transport"☆73Nov 22, 2018Updated 7 years ago
- ViZDoom Python wrapper☆75Apr 2, 2023Updated 2 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆552Mar 7, 2019Updated 6 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆43Jan 12, 2016Updated 10 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆248Sep 30, 2022Updated 3 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆617Jul 6, 2023Updated 2 years ago
- Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"☆1,625Oct 31, 2019Updated 6 years ago
- Retask is a simple task queue implementation written for human beings. It provides generic solution to create and manage task queues.☆19Feb 8, 2017Updated 9 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆973Jan 11, 2019Updated 7 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Apr 2, 2023Updated 2 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆263Feb 8, 2018Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 8 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Feb 8, 2016Updated 10 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆75Apr 2, 2023Updated 2 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago