Anesck / M-D-R_learning_notes
机器学习、深度学习、强化学习的读书笔记和代码
☆18Updated 4 years ago
Alternatives and similar repositories for M-D-R_learning_notes:
Users that are interested in M-D-R_learning_notes are comparing it to the libraries listed below
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆32Updated 4 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆132Updated 5 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆158Updated 5 years ago
- 多智能体学习库☆16Updated 3 years ago
- shouyuantianxia / Algorithmic-Game-Theory-Application-on-Multi-agent-Combat-and-Verification-Platform-Design本科毕业设计:《多智能体博弈兵棋推演理论与验证平台设计》的源代码附录内容。强化学习算法的实现上参考了周沫凡先生的开源代码https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow☆53Updated 4 years ago
- ☆22Updated 4 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆72Updated last year
- RL Algorithms☆13Updated last year
- ☆36Updated 5 years ago
- ☆13Updated 10 months ago
- 多智能体强化学习☆85Updated 6 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆184Updated 2 years ago
- A pack of reinforcement learning algorithms.☆82Updated 3 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆54Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 3 years ago
- PyTorch implementation of PPO algorithm☆21Updated 5 years ago
- RL-code for beginners. Enjoying!☆110Updated 4 years ago
- Source codes for the book "Application of Neural Network and PyTorch"☆155Updated 2 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆255Updated 6 years ago
- Evolutionary algorithms, alternative to Reinforcement Learning☆37Updated last year
- This is the official implementation of ERL-Re2.☆60Updated 7 months ago
- Simple Reinforcement learning tutorials☆14Updated 5 years ago
- 记录学习凸优化的笔记☆26Updated 4 years ago
- ☆12Updated 2 years ago
- ☆23Updated 2 years ago
- Q-learning based optimal path algorithm is a Reinforcement Learning algorithm☆13Updated last year
- Datawhale运筹组☆17Updated 2 years ago
- 在PyTorch上重构multi-agent deep deterministic policy gradient(MADDPG),将https://github.com/xuemei-ye/maddpg-mpe 修改到自己电脑上可运行。因为本人笔记本没有CUDA,实验速度…☆13Updated 5 years ago