DavidMouse1118 / Reinforcement-Learning-Maze-WorldLinks
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆29Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
Sorting:
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Updated 7 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆12Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 2 years ago
- OpenAI Gym environment designed for training RL agents to control the flight of a two-dimensional drone.☆55Updated 3 years ago
- ☆15Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆94Updated 4 years ago
- A clean and robust Pytorch implementation of TD3 on continuous action space☆29Updated last year
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆28Updated 4 months ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆44Updated 6 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆101Updated 4 months ago
- DQN examples codes in chapter 4☆43Updated 2 years ago
- The implementation of LSTM-TD3.☆85Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆189Updated last year
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆65Updated 6 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆53Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆163Updated last year
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆32Updated 4 years ago
- Reinforcement learning algorithms A2C, A3C and DQN☆16Updated last year
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆27Updated 5 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆52Updated last year
- OpenAI MountainCar-v0 DeepRL-based solutions (DQN, DuelingDQN, D3QN)☆25Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Updated 7 years ago
- meta-MADDPG (Python implementation)☆18Updated 7 years ago