trust region policy optimization base on gym and tensorflow, can run in distribution mode
☆15May 6, 2017Updated 8 years ago
Alternatives and similar repositories for trpo
Users that are interested in trpo are comparing it to the libraries listed below
Sorting:
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Content Addressable Memory using dimensionality reduction☆13Apr 22, 2017Updated 8 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- Deep Reinforcement Learning☆17Sep 1, 2017Updated 8 years ago
- 感谢@greyireland的模板,自己用C++重写了一遍,多少复习了点数据结构和算法的知识,为之后刷题和求职做准备☆20Apr 21, 2021Updated 4 years ago
- Modeling uncertainty information in deep learning☆22Jan 11, 2018Updated 8 years ago
- ☆20Apr 27, 2016Updated 9 years ago
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- ☆19Apr 25, 2016Updated 9 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Jul 10, 2017Updated 8 years ago
- A Siamese network implementation in torch (simple example on MNIST to embed to 2D space)☆23Aug 4, 2015Updated 10 years ago
- Data / annotations for video co-summarization (CVPR15)☆31Jan 3, 2017Updated 9 years ago
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- tensorflow reinforcement learning agents for OpenAI gym environments☆139Jul 21, 2017Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- ☆32Apr 27, 2017Updated 8 years ago
- Addressing Training-Test Class Distribution Mismatch in Conversational Classification for SemEval-2019 Task3 EmoContext☆10Apr 9, 2019Updated 6 years ago
- C++ library to work with Iso8583 messages☆10Sep 22, 2018Updated 7 years ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- Code for "CharManteau: Character Embedding Models For Portmanteau Creation. EMNLP 2017. Varun Gangal*, Harsh Jhamtani*, Graham Neubig, Ed…☆10Jun 20, 2019Updated 6 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Oct 17, 2016Updated 9 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Jun 2, 2020Updated 5 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Basic start of TensorFlow with TensorBoard☆11Apr 9, 2016Updated 9 years ago
- Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)☆10Apr 8, 2020Updated 5 years ago
- Orgmode-like folding for sideshow☆11Dec 14, 2018Updated 7 years ago
- BitmapScaler with different scaling algorhytms based on jxl-coder from awxkee☆11Jan 8, 2024Updated 2 years ago
- mobile part of the open SSI framework☆12Sep 5, 2018Updated 7 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- Simple Typescript/Javascript framework for DOM manipulation☆15Aug 25, 2022Updated 3 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- Simple Flask webservice to search through your PDF collection using Whoosh☆11Jul 11, 2014Updated 11 years ago
- SqueezeDet implemented in CUDA&TensorRT☆12Nov 2, 2018Updated 7 years ago
- ☆10Oct 12, 2021Updated 4 years ago