kevin-hanselman / grid-world-rlLinks
Value iteration, policy iteration, and Q-Learning in a grid-world MDP.
☆29Updated last year
Alternatives and similar repositories for grid-world-rl
Users that are interested in grid-world-rl are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 7 years ago
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆23Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 5 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller☆9Updated 7 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Updated 2 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- RL CIRL Research☆13Updated 2 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 7 years ago
- Solving openAI's game 'BipedalWalker-v2' with Deep Reinforcement Learning☆26Updated 5 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 3 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 6 years ago
- Notes and comments about Deep Reinforcement Learning papers☆77Updated 7 years ago
- Applying minimaxQ learning algorithm to 2 agents games☆33Updated 7 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 3 months ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Bayesian Inverse Reinforcement Learning with simple environments☆20Updated 3 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- ☆54Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆37Updated 6 years ago
- research and implementations of Deep RL agents and their applications☆51Updated 3 weeks ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆50Updated 4 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago