MyEncyclopedia / reinforcement-learning-2nd
☆22Updated 3 years ago
Related projects: ⓘ
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆70Updated last year
- [动手学强化学习]系列,基于pytorch。☆51Updated 3 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆46Updated 5 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆148Updated 5 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆15Updated 3 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆121Updated 4 years ago
- ☆20Updated 6 years ago
- 机器学习、深度学习、强化学习的读书笔记和代码☆18Updated 3 years ago
- 阿里云数智服务创新挑战赛——服务调度比赛☆17Updated 3 years ago
- Pytorch for Deep Reinforcement Learning☆235Updated 4 years ago
- A pack of reinforcement learning algorithms.☆80Updated 2 years ago
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆31Updated 3 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆53Updated 3 years ago
- rl on super-mario-bros☆50Updated 3 years ago
- ☆45Updated 5 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 6 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆83Updated 3 years ago
- 白话强化学习与PyTorch的学习笔记☆31Updated 4 years ago
- ☆13Updated 6 months ago
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆23Updated 3 months ago
- Tutorials of Tensorflow for beginners with popular data sets and projects. Let's have fun to learn Machine Learning with Tensorflow.☆112Updated 3 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆34Updated 5 years ago
- OpenAI团队的深度强化学习教程中文版☆71Updated last year
- ☆60Updated this week
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆157Updated 5 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆18Updated 4 years ago
- ☆159Updated 11 months ago
- 用 深度优先搜索 DFS 与 深度强化学习 DRL 分别自动控制 amazing brick 小游戏☆47Updated last month