jjkke88 / trpoView external linksLinks
trust region policy optimization base on gym and tensorflow, can run in distribution mode
☆15May 6, 2017Updated 8 years ago
Alternatives and similar repositories for trpo
Users that are interested in trpo are comparing it to the libraries listed below
Sorting:
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Content Addressable Memory using dimensionality reduction☆13Apr 22, 2017Updated 8 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- Deep Reinforcement Learning☆17Sep 1, 2017Updated 8 years ago
- Modeling uncertainty information in deep learning☆22Jan 11, 2018Updated 8 years ago
- 感谢@greyireland的模板,自己用C++重写了一遍,多少复习了点数据结构和算法的知识,为之后刷题和求职做准备☆20Apr 21, 2021Updated 4 years ago
- ☆20Apr 27, 2016Updated 9 years ago
- Implementation of TRPO and related algorithms☆646May 20, 2018Updated 7 years ago
- ☆19Apr 25, 2016Updated 9 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Jul 10, 2017Updated 8 years ago
- A Siamese network implementation in torch (simple example on MNIST to embed to 2D space)☆23Aug 4, 2015Updated 10 years ago
- Data / annotations for video co-summarization (CVPR15)☆30Jan 3, 2017Updated 9 years ago
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 9 years ago
- tensorflow reinforcement learning agents for OpenAI gym environments☆139Jul 21, 2017Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 8 years ago
- ☆32Apr 27, 2017Updated 8 years ago
- C++ library to work with Iso8583 messages☆10Sep 22, 2018Updated 7 years ago
- Code for "CharManteau: Character Embedding Models For Portmanteau Creation. EMNLP 2017. Varun Gangal*, Harsh Jhamtani*, Graham Neubig, Ed…☆10Jun 20, 2019Updated 6 years ago
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- Tool for technical analysis of financial data about companies indexed on the stockmarket using machine learning☆11Sep 6, 2017Updated 8 years ago
- Addressing Training-Test Class Distribution Mismatch in Conversational Classification for SemEval-2019 Task3 EmoContext☆10Apr 9, 2019Updated 6 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Oct 17, 2016Updated 9 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Jun 2, 2020Updated 5 years ago
- Cross-Platform Annotation Tool for Person Search Datasets☆11Aug 29, 2017Updated 8 years ago
- ☆11May 15, 2020Updated 5 years ago
- a torch implement of "Learning Temporal Transformations From Time-Lapse Video"☆10Sep 14, 2017Updated 8 years ago
- Simple Flask webservice to search through your PDF collection using Whoosh☆11Jul 11, 2014Updated 11 years ago
- Official sim2sim2real repo for TITA's Reinforcement Learning☆13Aug 15, 2025Updated 6 months ago
- Simple Typescript/Javascript framework for DOM manipulation☆15Aug 25, 2022Updated 3 years ago
- Orgmode-like folding for sideshow☆11Dec 14, 2018Updated 7 years ago
- DataLad extension for containerized environments☆11Nov 26, 2025Updated 2 months ago
- Super-Paramagnetic Clustering, Maximum entropy, Maximum Likelihood Methods.☆11Oct 18, 2021Updated 4 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- SimEc code relying on the theano library - check out the simec repo instead for keras based code!☆10Feb 28, 2018Updated 7 years ago