DavidMouse1118 / Reinforcement-Learning-Maze-WorldLinks

SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis

☆29

Alternatives and similar repositories for Reinforcement-Learning-Maze-World

Users that are interested in Reinforcement-Learning-Maze-World are comparing it to the libraries listed below

Sorting:

felix-kerkhoff / DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
☆29Updated 2 years ago
karush17 / esac
Evolution-based Soft Actor-Critic (ESAC)
☆42Updated last year
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆91Updated last year
BY571 / Deep-Reinforcement-Learning-Algorithm-Collection
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
☆78Updated 4 years ago
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 4 months ago
LxzGordon / Deep-Reinforcement-Learning-with-pytorch
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…
☆91Updated 4 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆23Updated 4 years ago
ImmanuelXIV / ppo-self-play
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
☆20Updated 2 years ago
puyuan1996 / MARL
Implementation for mSAC methods in PyTorch
☆42Updated 3 years ago
williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆98Updated 2 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆181Updated last year
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
RvuvuzelaM / self-attention-ppo-pytorch
I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf
☆35Updated 2 years ago
TJU-DRL-LAB / Multiagent-RL
The official code releasement of publications in MARL field of TJU RL lab.
☆79Updated 3 years ago
seolhokim / DistributedRL-Pytorch-Ray
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Updated 3 years ago
antonai91 / reinforcement_learning
☆15Updated 4 years ago
uoe-agents / robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
☆70Updated 10 months ago
HzcIrving / DecisionTransformer_StepbyStep
Decision Transformer: A brand new Offline RL Pattern.
☆36Updated 3 years ago
jianzhnie / deep-marl-toolkit
MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…
☆141Updated last year
seungju-k1m / sac-td3-td7
pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.
☆12Updated last year
wisnunugroho21 / reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆53Updated 4 years ago
yashbonde / Transformer-RL
Experiments to train transformer network to master reinforcement learning environments.
☆32Updated 4 years ago
Howuhh / prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
☆81Updated 2 years ago
Bigpig4396 / PyTorch-Deep-Recurrent-Q-Learning-DRQN
☆42Updated 6 years ago
catezi / MAPT
This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…
☆10Updated 6 months ago
LinghengMeng / LSTM-TD3
The implementation of LSTM-TD3.
☆82Updated 2 years ago
proroklab / HetGPPO
Heterogeneous Multi-Robot Reinforcement Learning
☆51Updated 10 months ago
kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆165Updated last year
david1309 / Multi_Task_RL
Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow
☆17Updated 6 years ago