cvhu / CliffWalkingLinks

Comparison between Sarsa and Q-Learning algorithms on risk handling

☆17

Alternatives and similar repositories for CliffWalking

Users that are interested in CliffWalking are comparing it to the libraries listed below

Sorting:

jeappen / gym-grid
A simple Gridworld environment for Open AI gym
☆25Updated 7 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
alok / rl_implementations
☆43Updated 6 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
deepsense-ai / Distributed-BA3C
☆56Updated 2 years ago
wassname / world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…
☆32Updated 6 years ago
yidarvin / DREAM_DM_starter_code
Some starter code for training/testing some basic CNN models given our data.
☆10Updated 8 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
aborghi / retro_contest_agent
☆29Updated 7 years ago
toru34 / tf_optimizers
Optimizers in tensorflow from scratch
☆18Updated 8 years ago
Tsdevendra1 / NEAT-Algorithm
☆50Updated 6 years ago
david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
williamd4112 / awesome-deep-reinforcement-learning
A collection of resources about deep reinforcement learning
☆24Updated 8 years ago
activatedgeek / torchrl
Highly Modular and Scalable Reinforcement Learning
☆115Updated 5 years ago
RL-Group / tutorials-and-papers
Collection of tutorials, exercises and papers on RL
☆17Updated 7 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆50Updated 6 years ago
rgilman33 / baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆53Updated 5 years ago
Santara / RAIL
Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018
☆15Updated 3 years ago
Jianbo-Lab / deep-learning-project
☆8Updated 8 years ago
dustinvtran / bayesrl
A Python library for reinforcement learning using Bayesian approaches
☆54Updated 10 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
hardikbansal / Qlearning
☆39Updated 7 years ago
AdeelMufti / DifferentiableNeuralComputer
Optimized Differentiable Neural Computer In Chainer
☆23Updated 6 years ago
pmlg / deep-rl-bootcamp
Deep RL Bootcamp solutions
☆35Updated 7 years ago
ASzot / imagination-augmented-agents-tf
Imagination Augmented Agents TensorFlow
☆26Updated 5 years ago
rll / deeprlhw2
☆24Updated 9 years ago
openai / sonic-on-ray
Training Sonic with RLlib
☆59Updated 2 years ago
brendenlake / AAI-site
NYU PSYCH-GA 3405.001 / DS-GA 3001.014 : Advancing AI through cognitive science
☆132Updated 6 years ago
aitorzip / deepbootcamp
Solved lab problems, slides and notes of the Deep Reinforcement Learning bootcamp 2017 held at UCBerkeley
☆42Updated 7 years ago
avdmitry / rl_3d
Reinforcement learning in 3D.
☆21Updated 8 years ago