xushsh163 / A3CSuperMario_Windows
SuperMario A3C Trainer for windows
☆32Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for A3CSuperMario_Windows
- rl on super-mario-bros☆50Updated 3 years ago
- 游戏AI探索者☆16Updated 6 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 5 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆161Updated 5 years ago
- Stable Baselines官方文档中文版☆93Updated 3 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境 训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆71Updated last year
- 用强化学习来玩微信跳一跳☆17Updated 6 years ago
- A student implementation of Alpha Go Zero☆279Updated 6 years ago
- Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.☆277Updated 4 months ago
- Resources of 3D Wizard Projects☆62Updated 3 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆157Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆173Updated 2 months ago
- ☆33Updated 6 years ago
- ☆24Updated 3 years ago
- 基于DQN的五子 棋人机对弈☆55Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆157Updated 3 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆161Updated 6 months ago
- 星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API☆228Updated 3 years ago
- 天授中文文档☆55Updated 2 years ago
- Playing Flappy Bird Using Deep Reinforcement Learning (Based on Deep Q Learning DQN using Tensorflow)☆577Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆254Updated 5 years ago
- Gym - 32 levels of original Super Mario Bros☆286Updated 5 years ago
- 强化学习☆62Updated 5 years ago
- Reinforcing Your Learning of Reinforcement Learning☆88Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆85Updated last month
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆152Updated 5 years ago
- ☆59Updated 5 years ago
- ☆158Updated last year