Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆361Jun 2, 2020Updated 5 years ago
Alternatives and similar repositories for trpo
Users that are interested in trpo are comparing it to the libraries listed below
Sorting:
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,048Jun 10, 2023Updated 2 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆974Jan 11, 2019Updated 7 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 8 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 7 years ago
- Guided Policy Search☆604Feb 9, 2021Updated 5 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆730Nov 22, 2018Updated 7 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆276Mar 22, 2018Updated 7 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,471Dec 7, 2022Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.☆2,167Apr 2, 2023Updated 2 years ago
- Reinforcement learning environments with musculoskeletal models☆943Jan 24, 2022Updated 4 years ago
- Publicly releasable baselines for the Retro contest☆130Nov 22, 2018Updated 7 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Multitask Environments for RL☆282Aug 23, 2021Updated 4 years ago
- Tensorforce: a TensorFlow library for applied reinforcement learning☆3,311Jul 31, 2024Updated last year
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- Soft Actor-Critic☆1,222Nov 29, 2023Updated 2 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,651Aug 1, 2024Updated last year
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,122Oct 13, 2017Updated 8 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.☆877Oct 16, 2021Updated 4 years ago
- Actor-critic with experience replay☆257Oct 9, 2022Updated 3 years ago
- Collection of Deep Reinforcement Learning algorithms☆300Mar 19, 2019Updated 6 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆662Feb 25, 2020Updated 6 years ago
- ☆32Apr 27, 2017Updated 8 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Oct 15, 2021Updated 4 years ago