zeynepCankara / Cliff-Walking-Solution
Q-learning and SARSA algorithms from Sutton's Reinforcement Learning book.
☆18Updated 5 years ago
Alternatives and similar repositories for Cliff-Walking-Solution:
Users that are interested in Cliff-Walking-Solution are comparing it to the libraries listed below
- 2021年秋季南京大学 强化学习 课程作业☆8Updated 3 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆137Updated 3 years ago
- DQN with pytorch with on Breakout and SpaceInvaders☆25Updated 5 years ago
- ☆159Updated last year
- ☆58Updated last year
- 此项目中将上传我在B站《强化学习理论基础》系列视频中的板书、参考资料等内容。☆72Updated 2 years ago
- ☆304Updated 2 years ago
- Unofficial Supplementary Materials for Reinforcement Learning Course at CUHK: textbooks, slides, related papers, assignment, code ...☆27Updated 4 years ago
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆15Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆149Updated last year
- ☆123Updated 6 months ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆30Updated 8 months ago
- Actor Critic model to play Cartpole game☆52Updated 6 years ago
- Half Field Offense in Robocup 2D Soccer with reinforcement learning☆34Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆170Updated 2 months ago
- ☆20Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 3 years ago
- pytorch实现的一些MARL算法☆65Updated 3 years ago
- Deep Q-Learning (DQN) implementation for Atari pong.☆77Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated last year
- ☆193Updated last year
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆221Updated last week
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆69Updated 5 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆71Updated 2 months ago
- A plotter for reinforcement learning (RL)☆218Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- A python module designed for agile RL algorithm developing.☆26Updated 7 months ago
- ☆90Updated 2 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆42Updated 3 years ago
- Code for Weighted QMIX☆129Updated 4 years ago