michaeltinsley / Gridworld-with-Q-Learning-Reinforcement-Learning-
Jupyter notebook containing a solution to Sutton and Barto's gridworld problem with both a random agent and a Q-learning agent.
☆28Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Gridworld-with-Q-Learning-Reinforcement-Learning-
- ☆181Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆66Updated 3 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆73Updated 4 years ago
- ☆41Updated 5 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆17Updated last week
- ☆46Updated 5 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆96Updated 2 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆12Updated last year
- Personal experiments on Reinforcement Learning☆106Updated 3 years ago
- ☆14Updated last year
- Implementations of Deep Reinforcement Learning Algorithms and Bench-marking with PyTorch☆115Updated 4 years ago
- Minimalistic implementation of Vanilla Policy Gradient with PyTorch☆18Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆74Updated 10 months ago
- A Minimal Deep Q-Network☆56Updated 4 months ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆50Updated 3 years ago
- ☆14Updated 3 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆301Updated 3 years ago
- A clean and robust Pytorch implementation of TD3 on continuous action space☆23Updated 5 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆271Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆58Updated last year
- Hands-on Reinforcement Learning with PyTorch, published by [Packt]☆49Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆96Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆110Updated 7 months ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆96Updated 5 years ago
- Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning☆60Updated 2 years ago