DavidMouse1118 / Reinforcement-Learning-Maze-World
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆30Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World:
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated last month
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆125Updated 11 months ago
- The official code releasement of publications in MARL field of TJU RL lab.☆72Updated 2 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆68Updated 8 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 7 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆34Updated 4 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆51Updated 4 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆32Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated 3 weeks ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 4 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆35Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆23Updated 6 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 8 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- ☆40Updated 3 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆45Updated 5 years ago
- Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021☆68Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆53Updated last year
- ☆41Updated 5 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- an implementation of ATOC☆14Updated 3 years ago