DavidMouse1118 / Reinforcement-Learning-Maze-WorldLinks
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆29Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Updated last year
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆77Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- The implementation of LSTM-TD3.☆81Updated 2 years ago
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Updated 6 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- Transformer in RL for decision-making☆96Updated 2 years ago
- Implementation of PPO Lagrangian in PyTorch☆49Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆180Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆140Updated last year
- ☆41Updated 5 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆24Updated 5 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆72Updated 3 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Updated 7 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆29Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated last year
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆43Updated 6 years ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 5 months ago
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆23Updated 4 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 3 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆50Updated 10 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆68Updated 10 months ago
- [TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving☆11Updated last year
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆34Updated last year