zeynepCankara / Cliff-Walking-Solution
Q-learning and SARSA algorithms from Sutton's Reinforcement Learning book.
☆17Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Cliff-Walking-Solution
- Unofficial Supplementary Materials for Reinforcement Learning Course at CUHK: textbooks, slides, related papers, assignment, code ...☆26Updated 4 years ago
- ☆20Updated 4 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆20Updated 2 years ago
- 此项目中将上传我在B站《强化学习理论基础》系列视频中的板书、参考资料等内容。☆72Updated last year
- ☆17Updated 2 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆50Updated 4 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆52Updated 2 years ago
- 2021年秋季南京大学 强化学习 课程作业☆10Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆70Updated last year
- ☆26Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆27Updated 3 years ago
- Hello😜☆30Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated last year
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆63Updated 3 years ago
- ☆121Updated 3 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆21Updated last year
- A beamer template for LAMDA lab at NJU☆14Updated 4 years ago
- ☆89Updated 2 years ago
- ☆118Updated 4 months ago
- A variant of Varibad that is robust to difficult tasks☆9Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- Meta RL codebase for Unstable Baselines☆20Updated last year
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆116Updated 3 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆55Updated 2 years ago
- Pytorch solutions for UC Berkeley's cs285 assignments☆121Updated 2 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆89Updated 3 years ago
- ☆21Updated 6 years ago
- ☆28Updated 2 years ago