☆18Apr 25, 2016Updated 10 years ago
Alternatives and similar repositories for trpo
Users that are interested in trpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- ☆99Aug 15, 2016Updated 9 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- ☆18Mar 5, 2017Updated 9 years ago
- ☆20Apr 27, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of TRPO and related algorithms☆651May 20, 2018Updated 8 years ago
- Learning to Reinforcement Learn☆11Nov 22, 2022Updated 3 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Jun 13, 2016Updated 9 years ago
- Wikipedia navigation environment for OpenAI Gym☆41Apr 2, 2023Updated 3 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 8 years ago
- ☆38Mar 6, 2017Updated 9 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Mar 8, 2018Updated 8 years ago
- pybullet_animations☆12Nov 13, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- trust region policy optimization base on gym and tensorflow, can run in distribution mode☆15May 6, 2017Updated 9 years ago
- ☆25Oct 22, 2015Updated 10 years ago
- RWA in pytorch☆14May 7, 2017Updated 9 years ago
- ☆28Apr 15, 2017Updated 9 years ago
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 10 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 9 years ago
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- Repo for code for the NIPS paper entitled "An Architecture for Deep, Hierarchical Generative Models"☆14Oct 27, 2016Updated 9 years ago
- imperative programming in TensorFlow☆18Dec 12, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆36Aug 2, 2016Updated 9 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆347Nov 22, 2018Updated 7 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated 3 weeks ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆64Mar 10, 2018Updated 8 years ago
- Learning to Avoid Errors in GANs by Input Space Manipulation (Code for paper)☆23Jul 7, 2017Updated 8 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆76Apr 2, 2023Updated 3 years ago
- Code for R:SS 2021 paper RMP2: A Structured Composable Policy Class for Robot Learning.☆44Jun 26, 2021Updated 4 years ago
- Incorporates external dependencies into HTML file using data: URI scheme☆21Nov 17, 2011Updated 14 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tensorflow Implementation of the (Dual)-Associative Memory GRUs☆18Jun 14, 2016Updated 9 years ago
- Some Reinforcement Learning in Python☆115Apr 17, 2017Updated 9 years ago
- ☆28Apr 28, 2019Updated 7 years ago
- Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders☆22Aug 11, 2016Updated 9 years ago
- Using Luxor.jl to design common diagrams found in Category Theory 🐱☆13Mar 26, 2022Updated 4 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆214Feb 16, 2018Updated 8 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45May 20, 2017Updated 9 years ago