wptoux / deep-tiaotiao
用强化学习来玩微信跳一跳
☆17Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for deep-tiaotiao
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆123Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- reinforcement learning☆38Updated 6 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 6 years ago
- Reinforcement Learning in Python☆107Updated 4 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 5 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆254Updated 5 years ago
- 比赛开源汇总☆79Updated 5 years ago
- 使用pytorch构建深度强化学习模型DQN☆24Updated 6 years ago
- ☆33Updated 6 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- 白话强化学习与PyTorch的学习笔记☆33Updated 4 years ago
- Implementation of 33th in Kaggle Competition - Quora Pairs☆24Updated 7 years ago
- OpenAI团队的深度强化学习教程中文版☆74Updated last year
- ☆32Updated 4 years ago
- 应用强化学习在复杂的交通环境下自动学习最佳驾驶策略的方案,在测试环境下准确率达到100%。☆20Updated 7 years ago
- UCB CS294-112 深度强化学习中文笔记☆49Updated 3 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 6 years ago
- keras sparse implement of margin-softmax☆100Updated 6 years ago
- 强化学习面试(未完待续)☆32Updated 4 years ago
- 将 DQN 应用在微信跳一跳小程序☆14Updated 6 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆71Updated last year
- 根据别人和自己在机器学习岗、深度学习岗的面试问题以及答案总结☆34Updated 5 years ago
- 一些利用pytorch编程实现的强化学习例子☆35Updated 5 years ago
- 第三届魔镜杯 智能客服问题相似性算法设计 第12名解决方案☆149Updated 5 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆157Updated 5 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆90Updated 3 years ago
- ☆32Updated 4 years ago