ludobouan / Q-learning-gridworld
Reinforcement learning on gridworld with Q-learning
☆9Updated 8 years ago
Alternatives and similar repositories for Q-learning-gridworld:
Users that are interested in Q-learning-gridworld are comparing it to the libraries listed below
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- ☆43Updated 7 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Implementation of HER algorithm in the bit-flipping environment.☆17Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆59Updated 6 years ago
- Random MDP experiments on true online TD from a forthcoming work by van Seijen et al. (2015)☆8Updated 8 years ago
- ☆35Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆29Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- ☆65Updated 10 months ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Updated 6 years ago
- ☆8Updated 7 years ago
- Implementation of Deepmind's Neural Episodic Control☆58Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 6 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆26Updated 5 years ago
- ☆98Updated 8 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago