Breakend / MultiStepBootstrappingInRLLinks

Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.

☆14

Alternatives and similar repositories for MultiStepBootstrappingInRL

Users that are interested in MultiStepBootstrappingInRL are comparing it to the libraries listed below

Sorting:

jangirrishabh / toyCarIRL
Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinf…
☆176Updated 3 years ago
liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
samlanka / DDPG-PyTorch
Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite
☆25Updated 6 years ago
greydanus / baby-a3c
A high-performance Atari A3C agent in 180 lines of PyTorch
☆171Updated 3 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
ikostrikov / pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
☆309Updated 4 years ago
camigord / DRL_papernotes
Notes and comments about Deep Reinforcement Learning papers
☆77Updated 7 years ago
yrlu / reinforcement_learning
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
☆151Updated 2 years ago
gabrielgarza / openai-gym-policy-gradient
Reinforcement Learning using Policy Gradient to solve OpenAI Gym games
☆113Updated 7 years ago
Jiankai-Sun / Proximal-Policy-Optimization-Pytorch
Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch
☆15Updated 7 years ago
Anjum48 / rl-examples
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
☆103Updated 4 years ago
stevenpjg / ddpg-aigym
Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…
☆274Updated 7 years ago
chingyaoc / pytorch-REINFORCE
PyTorch Implementation of REINFORCE for both discrete & continuous control
☆266Updated 8 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆84Updated 7 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago
hoangminhle / hierarchical_IL_RL
Code for hierarchical imitation learning and reinforcement learning
☆294Updated 7 years ago
rgilman33 / simple-A2C-PPO
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
☆102Updated 5 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Updated 6 years ago
takoika / PrioritizedExperienceReplay
Yet another prioritized experience replay buffer implementation.
☆48Updated 2 years ago
ikostrikov / pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
☆441Updated 6 years ago
anagabandi / nn_dynamics
☆344Updated 7 years ago
navneet-nmk / pytorch-rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
☆446Updated 6 years ago
xuwd11 / cs294-112_hws
My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning
☆92Updated 6 years ago
ahq1993 / inverse_rl
Adversarial Imitation Via Variational Inverse Reinforcement Learning
☆95Updated 5 years ago
jingweiz / pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Updated 6 years ago
andrewliao11 / gail-tf
Tensorflow implementation of generative adversarial imitation learning
☆199Updated 7 years ago
tianheyu927 / mil
Code for "One-Shot Visual Imitation Learning via Meta-Learning"
☆289Updated 6 years ago
brendanator / atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
☆138Updated last year
Nasdin / ReinforcementLearning-AtariGame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…
☆187Updated 10 months ago