aaksham / frozenlake
Value & Policy Iteration for the frozenlake environment of OpenAI
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for frozenlake
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Updated 5 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 6 years ago
- Reinforcement learning on gridworld with Q-learning☆9Updated 7 years ago
- ☆35Updated 6 years ago
- This repository contains the game bubble shooter as a gym environment. Based on: https://github.com/justinmeister/bubbleshooter☆17Updated 4 years ago
- Code from my blog post & online course☆54Updated 5 years ago
- [2019] (Neurips workshop paper) Blending behavioral cloning and RL☆9Updated last year
- Tensorflow implementation of A3C algorithm☆48Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Notes and comments about Deep Reinforcement Learning papers☆76Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- General implementation of Advantage Actor Critic using Pytorch☆26Updated 2 years ago
- ☆18Updated 5 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- A 150-lines python code for Augmented Random Search (https://arxiv.org/abs/1803.07055) with numpy.☆70Updated 5 years ago
- Shared autonomy via deep reinforcement learning☆74Updated last year
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆26Updated 11 months ago
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 2 years ago
- ☆27Updated 3 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- research and implementations of Deep RL agents and their applications☆47Updated 3 weeks ago
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Updated 7 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago