Q-learning and SARSA algorithms from Sutton's Reinforcement Learning book.
☆21May 19, 2019Updated 6 years ago
Alternatives and similar repositories for Cliff-Walking-Solution
Users that are interested in Cliff-Walking-Solution are comparing it to the libraries listed below
Sorting:
- ☆10Dec 31, 2020Updated 5 years ago
- Code for CLVision workshop (CVPR 2024) paper - Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-train…☆11Nov 12, 2024Updated last year
- ☆12Aug 15, 2020Updated 5 years ago
- Official Code for PACDA (CVPR Workshop - 4th CLVision)☆12Jun 13, 2023Updated 2 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- ☆11Aug 16, 2018Updated 7 years ago
- ☆13Jan 5, 2021Updated 5 years ago
- ☆17Mar 21, 2021Updated 4 years ago
- A tutorial to learn RL from examples. This is my notes from a course of Baidu PaddlePaddle. (世界冠军带你从零实践强化学习)☆12Jul 26, 2023Updated 2 years ago
- ☆14Feb 18, 2023Updated 3 years ago
- Cliff walking reinforcement learning example, with a variety of RL algorithms☆15Dec 5, 2023Updated 2 years ago
- notes for NJU courses☆18Oct 26, 2021Updated 4 years ago
- POLAR official tool☆20Feb 13, 2026Updated 3 weeks ago
- Regularized Learning under label shifts☆18May 1, 2019Updated 6 years ago
- PyTorch Implementation of Reptile☆22Jan 5, 2021Updated 5 years ago
- my big project PA for the course Introduction To Computer System in Nanjing University, building an operating system based on qemu called…☆13Dec 17, 2019Updated 6 years ago
- A PyTorch framework for Continual Learning research.☆23Dec 7, 2023Updated 2 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- Code for "Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning"☆22Oct 26, 2022Updated 3 years ago
- ☆21Mar 25, 2023Updated 2 years ago
- 南京大学计算机科学与技术系 2020操作系统课程实验☆20Jul 1, 2025Updated 8 months ago
- ☆24Sep 24, 2021Updated 4 years ago
- Neuro-Inspired Stability-Plasticity Adaptation for Sparse Continual Learning☆22Mar 4, 2023Updated 3 years ago
- The official code release for Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization☆34Mar 9, 2025Updated last year
- ☆29Dec 26, 2017Updated 8 years ago
- The latest source code of the tool Flow*☆28Jan 15, 2023Updated 3 years ago
- OrCo: Towards Better Generalization via Orthogonality and Contrast for Few-Shot Class-Incremental Learning☆33Oct 13, 2025Updated 4 months ago
- Solution for Zillow’s Home Value Prediction Competition on Kaggle☆24Jan 6, 2019Updated 7 years ago
- Behavioural cloning experiments with video games☆32Apr 15, 2020Updated 5 years ago
- ☆26Mar 12, 2015Updated 10 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- Noise Contrastive Estimation (NCE) in PyTorch☆32Mar 2, 2025Updated last year
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆32Aug 22, 2020Updated 5 years ago
- ☆35Jun 9, 2025Updated 9 months ago
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆30Aug 19, 2019Updated 6 years ago
- ☆33May 19, 2023Updated 2 years ago
- 《机器学习》-周志华(西瓜书)是一本了解机器学习知识比较全面的一本书。今年求职算法岗位时针对部分章节内容进行了整理和总结,并对其中的数学公式进行了手工推导。在此记录下自己的学习笔记。☆32Jan 26, 2021Updated 5 years ago
- Official implementation of CVPR 2023 paper Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation.☆39May 25, 2023Updated 2 years ago
- Operator System Lab for Nanjing University CS Department☆26Jun 6, 2022Updated 3 years ago