Sharpiless / PARL-DQN-daxiguaLinks
用parl框架的DQN强化学习算法玩“合成大西瓜”
☆14Updated 4 years ago
Alternatives and similar repositories for PARL-DQN-daxigua
Users that are interested in PARL-DQN-daxigua are comparing it to the libraries listed below
Sorting:
- 深度学习入门 | 三岁在飞桨带你入门深度学习—Carpoel,利用PARL复现基于神经网络与DQN算法(真的是0基础)☆11Updated 3 years ago
- 强化学习教程☆22Updated 4 years ago
- Optimization of Yagi Antenna Based on Reinforcement Learning Framework Parl☆18Updated last year
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Updated 2 years ago
- 用强化学习DQN算法,训练AI模型来玩合成大西瓜游戏,提供Keras版本和PARL(paddle)版本☆90Updated 4 years ago
- ☆14Updated 4 years ago
- pacman with paddlepaddle gesture control,手势识别用于吃豆人小游戏☆16Updated 4 years ago
- UAV offloading based on QMIX☆14Updated last year
- paddle cifar100 training☆14Updated 4 years ago
- shouyuantianxia / Algorithmic-Game-Theory-Application-on-Multi-agent-Combat-and-Verification-Platform-Design本科毕业设计:《多智能体博弈兵棋推演理论与验证平台设计》的源代码附录内容。强化学习算法的实现上参考了周沫凡先生的开源代码https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow☆57Updated 5 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆74Updated 2 years ago
- 一个基于图神经网络的强化学习网络资源分配模型☆29Updated 3 years ago
- 可用于PaddlePaddle的RIFLE优化策略封装版,支持普通API与高阶API,并且只需向训练代码中插入一行代码即可使用RIFLE策略。☆21Updated 3 years ago
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Updated 6 years ago
- qmix☆22Updated 5 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆16Updated 3 years ago
- 基于PaddlePaddle的黑白影片色彩重建☆15Updated 4 years ago
- Dueling Double Deep Q Network with Prioritized Experience Replay Memory☆10Updated 2 years ago
- 使用Paddlehub实现最近大火的凡尔赛文案自动生成☆18Updated 3 years ago
- 在PyTorch上重构multi-agent deep deterministic policy gradient(MADDPG),将https://github.com/xuemei-ye/maddpg-mpe 修改到自己电脑上可运行。因为本人笔记本没有CUDA,实验速度…☆13Updated 6 years ago
- Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents☆8Updated 3 years ago
- 低代码序列数据处理框架,最短两行即可完成训练任务!☆12Updated 4 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆71Updated 3 years ago
- 车杆倒立摆DQN简单实现☆16Updated last year
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Updated 7 years ago
- ☆15Updated 5 years ago
- D3QN implementation using pytorch☆15Updated 4 years ago
- Dueling DQN Pytorch☆13Updated 3 years ago
- ☆14Updated 4 years ago