nd009 / qlearning_robotLinks
用 qlearning 算法走迷宫
☆54Updated 7 years ago
Alternatives and similar repositories for qlearning_robot
Users that are interested in qlearning_robot are comparing it to the libraries listed below
Sorting:
- Pytorch for Deep Reinforcement Learning☆256Updated 5 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆363Updated 4 months ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆268Updated 7 years ago
- ☆391Updated 5 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆156Updated 6 years ago
- 多智能体强化学习☆107Updated 7 years ago
- ☆20Updated 7 years ago
- Reinforcement-Learning-Notes, start with MDP.☆225Updated 3 years ago
- Tutorial for Reinforcement Learning☆190Updated 4 years ago
- [动手学强化学习]系列,基于pytorch。☆59Updated 4 years ago
- 强化学习☆66Updated 6 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆50Updated 6 years ago
- 本项目以一 个可视化配置的、以AgentRL为核心的强化学习框架,实现30分钟上手AgentRL 编程。后续增加AgentRL和本地Agent、MCP、A2A相关特性。☆79Updated 6 months ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆200Updated 6 years ago
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆100Updated 3 years ago
- ☆29Updated 7 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆150Updated 6 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆109Updated 5 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆61Updated 5 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Updated 6 years ago
- OpenAI团队的深度强化学习教程中文版☆91Updated 2 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Updated 6 years ago
- Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments☆858Updated 6 years ago
- 白话强化学习与PyTorch的学习笔记☆37Updated 5 years ago
- ☆1,042Updated 2 years ago
- reinforcement learning ddpg code. follow deepmind papers.☆59Updated 7 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆289Updated 4 years ago
- 这是中国研究生数学建模大赛的C题,用于解决航班恢复的问题。程序首先建立了飞机,航班,客户,机场四个类用于模拟航班调度环境。之后应用遗传算法寻找最优的航班调度方案☆33Updated 7 years ago
- shouyuantianxia / Algorithmic-Game-Theory-Application-on-Multi-agent-Combat-and-Verification-Platform-Design本科毕业设计:《多智能体博弈兵棋推演理论与验证平台设计》的源代码附录内容。强化学习算法的实现上参考了周沫凡先生的开源代码https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow☆60Updated 5 years ago
- 斯坦福 cs234 强化学习中文讲义☆208Updated 5 years ago