DavidMouse1118 / Reinforcement-Learning-Maze-World
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆28Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Reinforcement-Learning-Maze-World
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆89Updated last year
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆66Updated 3 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆110Updated 7 months ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆105Updated 2 weeks ago
- ☆41Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- The implementation of LSTM-TD3.☆64Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆18Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆53Updated 4 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆161Updated 2 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆74Updated 10 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆58Updated 2 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆95Updated 3 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- Collection of OpenAI parametrized action-space environments.☆58Updated last year
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆31Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆50Updated 3 years ago
- Implementation of HindSight Experience Replay paper with Pytorch☆25Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆26Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆92Updated 2 years ago
- UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D☆81Updated 7 months ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆43Updated 3 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆43Updated 5 years ago