panjd123 / D3QN-Snake
A greedy snake AI implemented with reinforcement learning(D3QN) algorithm under PyTorch framework.一个在PyTorch框架下使用强化学习(D3QN)实现的贪吃蛇AI。
☆14Updated 2 years ago
Alternatives and similar repositories for D3QN-Snake:
Users that are interested in D3QN-Snake are comparing it to the libraries listed below
- 基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI☆23Updated 3 years ago
- 基于蒙特卡洛树搜索算法编写的黑白棋AI算法☆13Updated 2 years ago
- 基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示☆18Updated 4 years ago
- Using deep reinforcement learning to play Snake game(贪 吃蛇).☆79Updated 3 years ago
- 2020美赛的个人参赛经验,附美赛备战资料以及C题M奖论文☆81Updated 4 years ago
- 数学建模算法与应用(司守奎,国防工业出版社)案例代码python实现☆44Updated 4 years ago
- ☆108Updated 3 months ago
- Introduction to AI course assignment at Berkeley in spring 2019☆74Updated 5 years ago
- [ICML 2024]Exploration and Anti-exploration with Distributional Random Network Distillation☆10Updated 3 months ago
- Half Field Offense in Robocup 2D Soccer with reinforcement learning☆33Updated 3 years ago
- Use seaborn to draw RL picture☆25Updated last year
- 强化学习求解迷宫问题,Q-learning和监督学习☆21Updated 4 years ago
- 武汉理工大学抢课/一键评教 pyqt☆12Updated 6 years ago
- Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch☆262Updated 4 years ago
- Reinforcement learning☆27Updated last month
- 深度强化学习贪吃蛇游戏。拥有完整游戏环境与AI接口。(项目未完成)☆37Updated 5 years ago
- 华为杯第二十届中国研究生数学建模竞赛☆28Updated last year
- ☆68Updated 3 months ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆49Updated 4 years ago
- rl-papers☆47Updated last year
- (ICML 2024) The official code for EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search☆21Updated 7 months ago
- implementation of MADDPG using PettingZoo and PyTorch☆122Updated last year
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆12Updated last year
- DQN Pytorch☆14Updated 3 years ago
- 强化学习玩超级马里奥☆60Updated 2 years ago
- The author solved the basic problem model of job shop scheduling using the simulated annealing algorithm.☆9Updated 8 months ago
- ☆100Updated last month
- 2020数学建模国赛B题沙漠游戏,同济经管小组论文介绍、源码、Latex论文,全国一等奖☆56Updated 4 years ago
- 这是2023华为软件精英挑 战赛 初赛阶段319万分的代码,广西省第一名,粤港澳区排名第8。该比赛要求选手在一个50m*50m的地图上,控制4台机器人进入任务调度,设计机器人的运动算法、路径规划算法、任务调度算法,去分布在地图上的各种类型的工作台购买或者出售商品,赚取差价,以…☆16Updated last year
- UAVGym是一个用python编写的GYM风格的无人机仿真环境,用于强化学习算法的研究。☆37Updated last year