☆101Aug 15, 2016Updated 9 years ago
Alternatives and similar repositories for trpo
Users that are interested in trpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Apr 25, 2016Updated 9 years ago
- Implementation of TRPO and related algorithms☆649May 20, 2018Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- ☆20Apr 27, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Sep 5, 2016Updated 9 years ago
- trust region policy optimization base on gym and tensorflow, can run in distribution mode☆15May 6, 2017Updated 8 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Jun 2, 2020Updated 5 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 8 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 8 years ago
- some RL algorithms☆19Dec 9, 2016Updated 9 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆348Nov 22, 2018Updated 7 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,055Jun 10, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch implementation of Trust Region Policy Optimization☆451Sep 13, 2018Updated 7 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆731Nov 22, 2018Updated 7 years ago
- Trust Region Policy Optimization with Generalized Advantage Estimator☆16Nov 15, 2018Updated 7 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Jul 20, 2018Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 8 years ago
- Training Sonic with RLlib☆62Apr 2, 2023Updated 3 years ago
- pybullet_animations☆12Nov 13, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆551Mar 7, 2019Updated 7 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Guided Policy Search☆602Feb 9, 2021Updated 5 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Jun 13, 2016Updated 9 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- Training neural networks with back-prop, feedback-alignment and direct feedback-alignment☆105Jan 15, 2018Updated 8 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆174Nov 3, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Oct 22, 2015Updated 10 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆588Aug 9, 2018Updated 7 years ago
- Deterministic Policy Gradient using torch7☆43Jun 2, 2016Updated 9 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆422Feb 13, 2019Updated 7 years ago
- Implementation of a simple example of Q learning in Torch.☆51Mar 5, 2017Updated 9 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago