reinfore learning tool box, contains trpo, a3c algorithm for continous action space
☆41Jan 27, 2018Updated 8 years ago
Alternatives and similar repositories for RL_toolbox
Users that are interested in RL_toolbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- trust region policy optimization base on gym and tensorflow, can run in distribution mode☆15May 6, 2017Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆37Oct 17, 2016Updated 9 years ago
- ☆58Aug 28, 2018Updated 7 years ago
- ☆25Sep 7, 2017Updated 8 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Exploration Strategies for Deep Reinforcement Learning☆39Oct 31, 2018Updated 7 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 8 years ago
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Jan 9, 2023Updated 3 years ago
- ☆17Sep 15, 2017Updated 8 years ago
- ☆55Dec 7, 2022Updated 3 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 5 years ago
- Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conferen…☆277Jan 16, 2020Updated 6 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tensorflow implementation of A3C algorithm☆46Jul 4, 2017Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Dec 26, 2017Updated 8 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆263Feb 8, 2018Updated 8 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- Biased matrix factorisation using TensorFlow☆19Jun 30, 2016Updated 9 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- ☆76Jun 7, 2017Updated 8 years ago
- This is a repository for machine translation with open license.☆24Dec 1, 2015Updated 10 years ago
- Asynchronous Advantage Actor Critic