cvhu / CliffWalkingLinks
Comparison between Sarsa and Q-Learning algorithms on risk handling
☆17Updated 8 years ago
Alternatives and similar repositories for CliffWalking
Users that are interested in CliffWalking are comparing it to the libraries listed below
Sorting:
- ☆56Updated 2 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 5 years ago
- Highly Modular and Scalable Reinforcement Learning☆118Updated 5 years ago
- ☆29Updated 7 years ago
- [ICML-18] Codes for the custom games we built to compare RL agents with humans☆66Updated 7 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 7 years ago
- Reinforcement learning algorithm implementations and ML experimentation workspace☆43Updated 6 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆55Updated 6 years ago
- Benchmark and build RL architectures that can do multitask and transfer learning.☆144Updated 2 years ago
- Direct Future Prediction (DFP ) in Keras☆109Updated 7 years ago
- Solved lab problems, slides and notes of the Deep Reinforcement Learning bootcamp 2017 held at UCBerkeley☆42Updated 7 years ago
- ☆24Updated 9 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- ☆47Updated 7 years ago
- This package allows to use PLE as a gym environment.☆72Updated 5 years ago
- Full World Models Implementation in Chainer☆166Updated 7 years ago
- my public website☆12Updated last year
- OpenAI Retro Contest☆66Updated 2 years ago
- Pytorch Cheatsheet☆91Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 7 years ago
- Web-based Reinforcement Learning Control Center☆65Updated 9 years ago
- Implementation of various Reinforcement Learning Algorithms☆27Updated 7 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago