MyEncyclopedia / reinforcement-learning-2nd
☆22Updated 4 years ago
Alternatives and similar repositories for reinforcement-learning-2nd:
Users that are interested in reinforcement-learning-2nd are comparing it to the libraries listed below
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 6 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆73Updated 2 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆59Updated 4 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- 白话强化学习与PyTorch的学习笔记☆35Updated 5 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- OpenAI团队的深度强化学习教程中文版☆78Updated last year
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆184Updated 6 years ago
- ☆20Updated 7 years ago
- ☆45Updated 5 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆20Updated 3 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆102Updated 4 years ago
- 用强化学习来玩微信跳一跳☆19Updated 7 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆260Updated 6 years ago
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- 机器学习、深度学习、强化学习的读书笔记和代码☆18Updated 4 years ago
- Pytorch for Deep Reinforcement Learning☆247Updated 4 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆142Updated 5 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Updated 6 years ago
- ☆13Updated last year
- RL Algorithms☆13Updated 2 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- ☆389Updated 4 years ago
- A* (A-Star) algorithm for finding the shortest path in a maze☆15Updated 4 years ago
- 用 qlearning 算法走迷宫☆51Updated 7 years ago
- ☆17Updated 2 years ago
- reinforcement learning☆46Updated 4 years ago
- ☆76Updated 3 years ago