wojzaremba/trpo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wojzaremba/trpo)

wojzaremba / trpo

☆99

Alternatives and similar repositories for trpo

Users that are interested in trpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ilyasu123 / trpo
View on GitHub
☆18Apr 25, 2016Updated 10 years ago
joschu / modular_rl
View on GitHub
Implementation of TRPO and related algorithms
☆654May 20, 2018Updated 8 years ago
kvfrans / parallel-trpo
View on GitHub
A parallel version of Trust Region Policy Optimization
☆65Mar 6, 2017Updated 9 years ago
tilarids / reinforcement_learning_playground
View on GitHub
Playground for reinforcement learning algorithms implemented in TensorFlow
☆16Oct 18, 2016Updated 9 years ago
wojzaremba / trpo_rnn
View on GitHub
☆20Apr 27, 2016Updated 10 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
DeNeutoy / act-rte-inference
View on GitHub
☆15Sep 5, 2016Updated 9 years ago
jjkke88 / trpo
View on GitHub
trust region policy optimization base on gym and tensorflow, can run in distribution mode
☆15May 6, 2017Updated 9 years ago
rlbayes / rllabplusplus
View on GitHub
☆162Jul 21, 2017Updated 9 years ago
pat-coady / trpo
View on GitHub
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆364Jun 2, 2020Updated 6 years ago
steveKapturowski / tensorflow-rl
View on GitHub
Implementations of deep RL papers and random experimentation
☆178Apr 7, 2018Updated 8 years ago
rmst / ddpg
View on GitHub
TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)
☆214Feb 16, 2018Updated 8 years ago
roosephu / needle
View on GitHub
some RL algorithms
☆19Dec 9, 2016Updated 9 years ago
ikostrikov / pytorch-trpo
View on GitHub
PyTorch implementation of Trust Region Policy Optimization
☆448Sep 13, 2018Updated 7 years ago
rll / rllab
View on GitHub
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
☆3,071Jun 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
openai / vime
View on GitHub
Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"
☆347Nov 22, 2018Updated 7 years ago
openai / imitation
View on GitHub
Code for the paper "Generative Adversarial Imitation Learning"
☆729Nov 22, 2018Updated 7 years ago
yjhong89 / TRPO-GAE
View on GitHub
Trust Region Policy Optimization with Generalized Advantage Estimator
☆16Nov 15, 2018Updated 7 years ago
carpedm20 / NAF-tensorflow
View on GitHub
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
☆192Jul 20, 2018Updated 8 years ago
rarilurelo / pcl_keras
View on GitHub
reinforcement learning. policy gradient. PCL
☆37Apr 25, 2017Updated 9 years ago
floringogianu / categorical-dqn
View on GitHub
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 8 years ago
openai / sonic-on-ray
View on GitHub
Training Sonic with RLlib
☆61Apr 2, 2023Updated 3 years ago
hardmaru / pybullet_animations
View on GitHub
pybullet_animations
☆12Nov 13, 2017Updated 8 years ago
Kaixhin / NoisyNet-A3C
View on GitHub
Noisy Networks for Exploration
☆187Jan 28, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hi-abhi / tensorflow-value-iteration-networks
View on GitHub
TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper
☆549Mar 7, 2019Updated 7 years ago
cbfinn / gps
View on GitHub
Guided Policy Search
☆600Feb 9, 2021Updated 5 years ago
miyosuda / episodic_control
View on GitHub
Model-Free Episodic Control
☆14Jan 12, 2017Updated 9 years ago
dirkweissenborn / mufuru
View on GitHub
Tensorflow Implementation of Multi-Function Recurrent Unit
☆23Jun 13, 2016Updated 10 years ago
ikostrikov / pytorch-ddpg-naf
View on GitHub
Implementation of algorithms for continuous control (DDPG and NAF).
☆311Feb 16, 2021Updated 5 years ago
anokland / dfa-torch
View on GitHub
Training neural networks with back-prop, feedback-alignment and direct feedback-alignment
☆105Jan 15, 2018Updated 8 years ago
andrewliao11 / pytorch-a3c-mujoco
View on GitHub
Implement A3C for Mujoco gym envs
☆73Nov 2, 2017Updated 8 years ago
jiamings / fast-weights
View on GitHub
Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)
☆173Nov 3, 2016Updated 9 years ago
rll / deeprlhw2
View on GitHub
☆25Oct 22, 2015Updated 10 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
muupan / async-rl
View on GitHub
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
☆408Feb 25, 2017Updated 9 years ago
miyosuda / unreal
View on GitHub
Reinforcement learning with unsupervised auxiliary tasks
☆424Feb 13, 2019Updated 7 years ago
siemanko / a3c
View on GitHub
Asynchronous Advantage Actor Critic
☆20Aug 15, 2016Updated 9 years ago
miyosuda / async_deep_reinforce
View on GitHub
Asynchronous Methods for Deep Reinforcement Learning
☆588Aug 9, 2018Updated 7 years ago
iassael / torch-policy-gradient
View on GitHub
Deterministic Policy Gradient using torch7
☆43Jun 2, 2016Updated 10 years ago
SeanNaren / QlearningExample.torch
View on GitHub
Implementation of a simple example of Q learning in Torch.
☆51Mar 5, 2017Updated 9 years ago
aravindr93 / robustRL
View on GitHub
Robust policy search algorithms which train on model ensembles
☆31Oct 26, 2016Updated 9 years ago