DavidMouse1118 / Reinforcement-Learning-Maze-WorldLinks
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆29Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
Sorting:
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 2 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆80Updated 5 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Updated 7 years ago
- Collection of OpenAI parametrized action-space environments.☆66Updated 7 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆94Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 5 months ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated 2 years ago
- ☆16Updated 4 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆69Updated 5 months ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆38Updated 4 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆146Updated 6 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆100Updated 5 months ago
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Updated 5 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 2 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆94Updated 4 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆44Updated 6 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆53Updated 4 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆148Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆201Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- ☆13Updated 5 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆55Updated last year
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 5 months ago
- Solving POMDP using Recurrent networks☆91Updated 5 years ago