DavidMouse1118 / Reinforcement-Learning-Maze-WorldLinks
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆29Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning-Maze-World
Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below
Sorting:
- Collection of OpenAI parametrized action-space environments.☆66Updated 8 months ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 2 years ago
- The implementation of LSTM-TD3.☆85Updated 2 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Updated 7 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆57Updated 2 weeks ago
- Implementation for mSAC methods in PyTorch☆41Updated 4 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 2 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆79Updated 5 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆83Updated 3 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- ☆13Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆96Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆71Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Updated 2 weeks ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 3 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆21Updated 2 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆37Updated 3 years ago
- Experiments with transformer based RL algorithms☆22Updated 6 years ago
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆12Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- Experiments for performing empirical game-theoretic analysis of networked system control for common-pool resource management using multi-…☆18Updated 5 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆59Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆205Updated last year
- Implementation of HindSight Experience Replay paper with Pytorch☆31Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 2 years ago
- A clean and robust Pytorch implementation of TD3 on continuous action space☆30Updated last year
- Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021☆73Updated 4 years ago