hysts / RL_refsLinks

☆11

Alternatives and similar repositories for RL_refs

Users that are interested in RL_refs are comparing it to the libraries listed below

Sorting:

ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆56Updated 6 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆50Updated 6 years ago
ArmaanSethi / Hindsight-Experience-Replay-and-Hierarchical-Reinforcement-Learning
Comp 781 Project
☆9Updated 6 years ago
cathywu / rllab-multiagent
☆11Updated 2 years ago
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆26Updated 3 years ago
nanxintin / deep-reinforcement-learning
Reinforcement Learning and Deep Learning Resources
☆16Updated 7 years ago
kschweig / OfflineRL
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
☆25Updated 2 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
martinseilair / learningoptimalcontrol
Great resources for learning optimal control
☆18Updated 6 years ago
YyzHarry / SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Updated 5 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆28Updated 6 years ago
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆19Updated 4 years ago
philipjball / SAC_PyTorch
🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation
☆38Updated 3 years ago
bryonkucharski / Learning-to-Drive-with-Reinforcement-Learning-and-Variational-Autoencoders
Project for my graduate neural networks course - combining RL with VAEs
☆23Updated 5 years ago
shamanez / VUSFA-Variational-Universal-Successor-Features-Approximator
This repository contains implementations of the paper VUSFA
☆14Updated 4 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Updated 5 years ago
akshaykhadse / reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…
☆17Updated 7 years ago
shaktikshri / adaptiveSystems
RL CIRL Research
☆13Updated 2 years ago
zsdonghao / Imitation-Learning-Dagger-Torcs
A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env
☆71Updated 7 years ago
resibots / kaushik_2018_multi-dex
Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)
☆13Updated 6 years ago
BY571 / Randomized-Ensembled-Double-Q-learning-REDQ-
Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)
☆21Updated 4 years ago
KMarino / hrl-ep3
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Updated 6 years ago
AdaCompNUS / qmdp-net
QMDP-Net implementation
☆65Updated 5 years ago
sparisi / td-reg
TD-Regularized Actor-Critic Methods
☆36Updated 5 years ago
kindredresearch / arp
Autoregressive policies for continuous control reinforcement learning
☆32Updated 6 years ago
Jeremy26 / bipedal-walker
Train a Bipedal Robot to walk using Reinforcement Learning
☆9Updated 6 years ago
kevin-hanselman / grid-world-rl
Value iteration, policy iteration, and Q-Learning in a grid-world MDP.
☆29Updated last year
tianjunz / MADE
☆19Updated 3 years ago
dnishio / DSAC
The implementation of Discriminator Soft Actor Critic
☆15Updated 5 years ago