Anesck / M-D-R_learning_notes
机器学习、深度学习、强化学习的读书笔记和代码
☆18Updated 4 years ago
Alternatives and similar repositories for M-D-R_learning_notes:
Users that are interested in M-D-R_learning_notes are comparing it to the libraries listed below
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆142Updated 5 years ago
- ☆36Updated 5 years ago
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆33Updated 4 years ago
- Evolutionary algorithms, alternative to Reinforcement Learning☆38Updated 2 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆184Updated 6 years ago
- ☆13Updated last year
- Q-learning based optimal path algorithm is a Reinforcement Learning algorithm☆13Updated 2 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆78Updated 3 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆102Updated 4 years ago
- ☆12Updated 2 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆59Updated 4 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆73Updated 2 years ago
- RL Algorithms☆13Updated 2 years ago
- A multi-agent version of the Double DQN algorithm, with Foraging Task and Pursuit Game test scenarios☆12Updated 8 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago
- 多智能体学习库☆18Updated 3 years ago
- A naive version.☆18Updated 3 years ago
- ☆23Updated 2 years ago
- ☆22Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆260Updated 6 years ago
- This is the official implementation of ERL-Re2.☆64Updated 10 months ago
- 记录学习凸优化的笔记☆26Updated 4 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆21Updated 6 years ago
- ☆43Updated last week
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆20Updated 3 years ago
- 使用pytorch构建深度强化学习模型DQN☆24Updated 7 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆346Updated 5 years ago