AdamStelmaszczyk / rl-tutorial
Source code for "A deep dive into reinforcement learning"
☆12Updated 5 years ago
Alternatives and similar repositories for rl-tutorial:
Users that are interested in rl-tutorial are comparing it to the libraries listed below
- A2C, ACKTR and A2T implementations for ViZDoom☆10Updated 7 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- StarCraft II Learning Environment☆18Updated 6 years ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 7 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks☆41Updated 5 years ago
- ☆69Updated 6 years ago
- Framework for inspecting actions and observatinos in StarCraftII replays☆20Updated 6 years ago
- ☆29Updated 6 years ago
- Learning to play supermario using A3C algorithm☆11Updated 6 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Keras implementation of Curiosity-driven Exploration by Self-supervised Prediction☆8Updated 7 years ago
- [2019] (Neurips workshop paper) Blending behavioral cloning and RL☆9Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- A Policy Network in Tensorflow to classify chess moves☆17Updated 8 years ago
- Training Sonic with RLlib☆59Updated 2 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- General implementation of Advantage Actor Critic using Pytorch☆27Updated 3 years ago
- ☆18Updated 6 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- A.I. learns how to drive with reinforcement learning☆22Updated 5 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Updated 7 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- ☆17Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago