Anesck / M-D-R_learning_notesLinks
机器学习、深度学习、强化学习的读书笔记和代码
☆18Updated 4 years ago
Alternatives and similar repositories for M-D-R_learning_notes
Users that are interested in M-D-R_learning_notes are comparing it to the libraries listed below
Sorting:
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆73Updated 2 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆186Updated 6 years ago
- 使用pytorch构建深度强化学习模型DQN☆25Updated 7 years ago
- ☆36Updated 5 years ago
- 记录学习凸优化的笔记☆27Updated 4 years ago
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆79Updated 3 years ago
- Dynamic channel allocation in cellular networks by reinforcement learning☆17Updated 3 years ago
- ☆13Updated last year
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆143Updated 5 years ago
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆33Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- ☆43Updated last month
- reinforcement learning☆38Updated 7 years ago
- 对抗强化学习文本分类☆9Updated 5 years ago
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- Source codes for the book "Application of Neural Network and PyTorch"☆155Updated 2 years ago
- 最优化方法、凸优化课程作业代码☆13Updated 5 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago
- Multi-objective reinforcement learning for covid-19 control☆11Updated 3 years ago
- Evolutionary algorithms, alternative to Reinforcement Learning☆38Updated 2 years ago
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation☆49Updated 5 years ago
- This is yangxx's repo for machine learning☆36Updated 3 years ago
- [动手学强化学习]系列,基于pytorch。☆55Updated 4 years ago
- 多智能体强化学习☆98Updated 6 years ago
- 小样本分类实践☆12Updated 4 years ago
- Image classification using reinforcement learning and multi-agent system☆49Updated 11 months ago
- RL algorithms☆141Updated 4 years ago
- ☆163Updated 5 years ago
- Q-learning based optimal path algorithm is a Reinforcement Learning algorithm☆13Updated 2 years ago