DavidMouse1118 / Reinforcement-Learning-Maze-WorldLinks
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆30Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
Sorting:
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆69Updated last week
- Collection of OpenAI parametrized action-space environments.☆65Updated 2 months ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆34Updated 2 years ago
- The implementation of LSTM-TD3.☆81Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 2 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆55Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆127Updated 3 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated 10 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 8 months ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 4 years ago
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Updated 5 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆63Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 3 weeks ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆154Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆141Updated 6 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆27Updated 5 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Updated last year
- Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021☆70Updated 3 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆70Updated 3 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆44Updated 6 years ago