xushsh163 / A3CSuperMario_Windows
SuperMario A3C Trainer for windows
☆31Updated 5 years ago
Related projects: ⓘ
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆46Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆158Updated 3 years ago
- rl on super-mario-bros☆50Updated 3 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- 游戏AI探索者☆16Updated 6 years ago
- Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.☆275Updated 2 months ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆160Updated 5 years ago
- 用 深度优先搜索 DFS 与 深度强化学习 DRL 分别自动控制 amazing brick 小游戏☆47Updated last month
- A student implementation of Alpha Go Zero☆276Updated 6 years ago
- Stable Baselines官方文档中文版☆93Updated 3 years ago
- 基于DQN的五子棋人机对弈☆54Updated 5 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆157Updated 5 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆151Updated 4 months ago
- Pytorch for Deep Reinforcement Learning☆235Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆252Updated 5 years ago
- 通过深度强化学习训练的AI玩起游戏来也是有板有眼,将人类玩家远远甩在身后。本文就将为您介绍如何训练AI玩微信飞机大战,biubiubiu~☆23Updated 5 years ago
- Playing Flappy Bird Using Deep Reinforcement Learning (Based on Deep Q Learning DQN using Tensorflow)☆576Updated 3 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆13Updated 5 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆428Updated 2 years ago
- [动手学强化学习]系列,基于pytorch。☆51Updated 3 years ago
- 用强化学习玩俄罗斯方块☆13Updated 6 years ago
- implement the classic reinforcement learning algorithm DQN to play supermariobrother☆15Updated 6 years ago
- ☆384Updated 4 years ago
- ☆45Updated 5 years ago
- Deep RL algorithm in pytorch☆284Updated last year
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆70Updated last year
- Resources of 3D Wizard Projects☆62Updated 3 years ago
- 用强化学习来玩微信跳一跳☆17Updated 6 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆185Updated 4 years ago
- 强化学习玩flappy bird☆21Updated 3 years ago