zhaoyingjun / general
Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。
☆72Updated last year
Alternatives and similar repositories for general:
Users that are interested in general are comparing it to the libraries listed below
- 一些利用pytorch编程实现的强化学习例子☆36Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- Pytorch for Deep Reinforcement Learning☆243Updated 4 years ago
- 多智能体强化学习☆87Updated 6 years ago
- Tutorial for Reinforcement Learning☆178Updated 3 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 3 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆186Updated 2 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆92Updated 4 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆48Updated 6 years ago
- ☆24Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆256Updated 6 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆167Updated 5 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Updated 5 years ago
- 天授中文文档☆55Updated 2 months ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- 白话强化学习与PyTorch的学习笔记☆33Updated 4 years ago
- ☆20Updated 6 years ago
- 使用pytorch构建深度强化学习模型DQN☆24Updated 7 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆158Updated 5 years ago
- A collection of multi agent environments based on OpenAI gym.☆22Updated last year
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆72Updated 2 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆139Updated 5 years ago
- qmix☆22Updated 4 years ago
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆24Updated last year
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 3 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆18Updated 2 years ago
- ☆45Updated 5 years ago
- RL algorithms☆140Updated 3 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆29Updated 5 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆56Updated 4 years ago