DavidMouse1118 / Reinforcement-Learning-Maze-World
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆29Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World:
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆92Updated last year
- The implementation of LSTM-TD3.☆72Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆57Updated last year
- PyTorch implementation of Soft Actor-Critic(SAC).☆102Updated 4 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆43Updated 5 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆97Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆47Updated 4 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 4 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆79Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆114Updated 8 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆121Updated 5 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆67Updated 5 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 4 months ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆26Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆67Updated 2 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆54Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 4 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆76Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 6 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆51Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆156Updated 9 months ago
- Implementation of HindSight Experience Replay paper with Pytorch☆27Updated 3 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆69Updated 4 months ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- Transformer in RL for decision-making☆82Updated last year