wptoux / deep-tiaotiao
用强化学习来玩微信跳一跳
☆18Updated 7 years ago
Alternatives and similar repositories for deep-tiaotiao:
Users that are interested in deep-tiaotiao are comparing it to the libraries listed below
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 5 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- ☆33Updated 7 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆123Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 6 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆255Updated 6 years ago
- 使用pytorch构建深度强化学习模型DQN☆24Updated 7 years ago
- 用强化学习玩俄罗斯方块☆16Updated 6 years ago
- ☆20Updated 6 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆158Updated 5 years ago
- 一些利用pytorch编程实现的强化学习例子☆35Updated 5 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆164Updated 5 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆54Updated 4 years ago
- rl on super-mario-bros☆50Updated 4 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆335Updated 5 years ago
- Collection of Deep Reinforcement Learning algorithms☆124Updated 7 years ago
- 强化学习面试(未完待续)☆32Updated 5 years ago
- reinforcement learning ddpg code. follow deepmind papers.☆60Updated 6 years ago
- ☆383Updated 4 years ago
- Some notes and experience about David Silver's Reinforcement Learning Course☆46Updated 5 years ago
- OpenAI团队的深度强化学习教程中文版☆75Updated last year
- reinforcement learning☆38Updated 6 years ago
- homework for CS294 Fall 2017☆168Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆92Updated 3 years ago
- Implementation of 33th in Kaggle Competition - Quora Pairs☆24Updated 7 years ago