cvhu / CliffWalking
Comparison between Sarsa and Q-Learning algorithms on risk handling
☆17Updated 7 years ago
Alternatives and similar repositories for CliffWalking:
Users that are interested in CliffWalking are comparing it to the libraries listed below
- ☆56Updated 2 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Optimized Differentiable Neural Computer In Chainer☆23Updated 6 years ago
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- ☆39Updated 7 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Trained models for keras-rl.☆21Updated 8 years ago
- Spinal cord gray matter segmentation using deep dilated convolutions.☆45Updated 7 years ago
- ☆43Updated 5 years ago
- Highly Modular and Scalable Reinforcement Learning☆113Updated 5 years ago
- Curated materials for different machine learning related summer schools☆19Updated 4 years ago
- Pytorch Implementation of Deepmind's 'Hybrid computing using a neural network with dynamic external memory' (Differentiable Neural Comput…☆19Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Imagination Augmented Agents TensorFlow☆26Updated 5 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Some starter code for training/testing some basic CNN models given our data.☆10Updated 8 years ago
- ☆8Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- ☆22Updated 6 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 7 months ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 9 years ago
- Optimizers in tensorflow from scratch☆18Updated 7 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Updated 8 years ago
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- ☆50Updated 5 years ago
- Simple change of a3c to a2c☆15Updated 7 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Various experiments on the [Fashion-MNIST](https://github.com/zalandoresearch/fashion-mnist) dataset from Zalando☆31Updated 7 years ago
- ☆20Updated last year