pat-coady / trpoView external linksLinks
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆361Jun 2, 2020Updated 5 years ago
Alternatives and similar repositories for trpo
Users that are interested in trpo are comparing it to the libraries listed below
Sorting:
- Implementation of TRPO and related algorithms☆646May 20, 2018Updated 7 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,040Jun 10, 2023Updated 2 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆435Nov 28, 2023Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆975Jan 11, 2019Updated 7 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 7 years ago
- Guided Policy Search☆603Feb 9, 2021Updated 5 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆729Nov 22, 2018Updated 7 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆276Mar 22, 2018Updated 7 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,471Dec 7, 2022Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.☆2,166Apr 2, 2023Updated 2 years ago
- Reinforcement learning environments with musculoskeletal models☆942Jan 24, 2022Updated 4 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Multitask Environments for RL☆281Aug 23, 2021Updated 4 years ago
- Tensorforce: a TensorFlow library for applied reinforcement learning☆3,312Jul 31, 2024Updated last year
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- Soft Actor-Critic☆1,210Nov 29, 2023Updated 2 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,651Aug 1, 2024Updated last year
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,123Oct 13, 2017Updated 8 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 8 years ago
- Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.☆876Oct 16, 2021Updated 4 years ago
- Actor-critic with experience replay☆256Oct 9, 2022Updated 3 years ago
- Collection of Deep Reinforcement Learning algorithms☆300Mar 19, 2019Updated 6 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆661Feb 25, 2020Updated 5 years ago
- ☆32Apr 27, 2017Updated 8 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Oct 15, 2021Updated 4 years ago