MyEncyclopedia / reinforcement-learning-2nd
☆22Updated 4 years ago
Alternatives and similar repositories for reinforcement-learning-2nd:
Users that are interested in reinforcement-learning-2nd are comparing it to the libraries listed below
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆72Updated 2 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆173Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆48Updated 6 years ago
- A translation of Reinforcement Learning: An Introduction☆115Updated 6 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆140Updated 5 years ago
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆32Updated 4 years ago
- ☆20Updated 7 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆96Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- OpenAI团队的深度强化学习教程中文版☆76Updated last year
- UCB CS294-112 深度强化学习 中文笔记☆51Updated 4 years ago
- A pack of reinforcement learning algorithms.☆83Updated 3 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆57Updated 4 years ago
- rl on super-mario-bros☆53Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆256Updated 6 years ago
- Tutorials of Tensorflow for beginners with popular data sets and projects. Let's have fun to learn Machine Learning with Tensorflow.☆115Updated 4 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆42Updated 5 years ago
- A* (A-Star) algorithm for finding the shortest path in a maze☆15Updated 4 years ago
- 用强化学习来玩微信跳一跳☆19Updated 7 years ago
- DQN examples codes in chapter 4☆43Updated 2 years ago
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆32Updated 4 years ago
- A explaintable and modified version of udacity DRL homework☆26Updated 4 years ago
- ☆162Updated last year
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- RL Algorithms☆13Updated 2 years ago
- 白话强化学习与PyTorch的学习笔记☆33Updated 4 years ago
- ☆390Updated 4 years ago
- lecture32_AI挑战星际争霸II(强化学习)☆17Updated 2 years ago
- reinforcement learning☆46Updated 4 years ago