aaksham / frozenlake
Value & Policy Iteration for the frozenlake environment of OpenAI
☆15Updated 6 years ago
Alternatives and similar repositories for frozenlake
Users that are interested in frozenlake are comparing it to the libraries listed below
Sorting:
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 7 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Web-based Reinforcement Learning Control Center☆64Updated 8 years ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- Implementation of A3C Algorithm on Atari Games☆6Updated 8 years ago
- Direct Future Prediction (DFP ) in Keras☆109Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 4 years ago
- ☆24Updated 9 years ago
- AI learning to walk in gym's BipedalWalker environment.☆66Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- An implementation of the Deep Deterministic Policy Gradient (DDPG) algorithm using Keras/Tensorflow with the robot simulated using ROS/Ga…☆61Updated 8 years ago
- random search, hill climbing, policy gradient☆143Updated 6 years ago
- Code from my blog post & online course☆54Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- ☆19Updated 6 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 3 years ago
- [2019] (Neurips workshop paper) Blending behavioral cloning and RL☆9Updated 2 years ago
- These will be public notes for courses that I'm self-studying.☆26Updated 4 years ago
- [Reimplementation Ross et al 2011] An implementation of DAGGER using ConvNets for driving from pixels.☆77Updated 7 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Updated 5 years ago