zhaoyingjun / generalLinks
Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。
☆73Updated 2 years ago
Alternatives and similar repositories for general
Users that are interested in general are comparing it to the libraries listed below
Sorting:
- 白话强化学习与PyTorch的学习笔记☆35Updated 5 years ago
- Pytorch for Deep Reinforcement Learning☆248Updated 4 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 6 years ago
- [动手学强化学习]系列,基于pytorch。☆55Updated 4 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- Tutorial for Reinforcement Learning☆184Updated 3 years ago
- 使用pytorch构建深度强化学习模型DQN☆25Updated 7 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Updated 6 years ago
- 多智能体强化学习☆98Updated 6 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆262Updated 6 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆103Updated 4 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆158Updated 6 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆143Updated 5 years ago
- ☆20Updated 7 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Updated 6 years ago
- 用 qlearning 算法走迷宫☆51Updated 7 years ago
- 天授中文文档☆58Updated 5 months ago
- RL algorithms☆141Updated 4 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆198Updated 2 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆186Updated 6 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆310Updated 2 years ago
- ☆29Updated 6 years ago
- ☆166Updated last year
- shouyuantianxia / Algorithmic-Game-Theory-Application-on-Multi-agent-Combat-and-Verification-Platform-Design本科毕业设计:《多智能体博弈兵棋推演理论与验证平台设计》的源代码附录内容。强化学习算法的实现上参考了周沫凡先生的开源代码https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow☆55Updated 4 years ago
- meta-MADDPG (Python implementation)☆18Updated 6 years ago
- Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.☆291Updated 10 months ago
- ☆24Updated 4 years ago
- A Toolbox for deep reinforcement learning(QLearning)☆38Updated 6 years ago
- qmix☆22Updated 5 years ago
- D3QN implementation using pytorch☆15Updated 4 years ago