Starlight0798 / gymRL
基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)
☆102Updated last month
Alternatives and similar repositories for gymRL:
Users that are interested in gymRL are comparing it to the libraries listed below
- 动手学 强化学习代码☆54Updated last year
- Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch☆288Updated 5 years ago
- kinds of reinforcement learning model by Pytorch☆327Updated 2 years ago
- Projects from basic algorithms to MARL. Implements MADDPG/MATD3 in Predator-Prey pursuit games with PettingZoo MPE environments.☆62Updated last week
- ☆359Updated last week
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆62Updated 2 weeks ago
- TD3 in Pytorch☆33Updated 3 years ago
- Multi-UAV target round-up based on MADDPG☆147Updated 2 months ago
- UAVGym是一个用python编写的GYM风格的无人机仿真环境,用于强化学习算法的研究。☆53Updated last year
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆196Updated 2 years ago
- ☆153Updated 2 months ago
- A Collection of Multi-Agent Reinforcement Learning (MARL) Resources☆232Updated 2 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆139Updated last year
- RL algorithms☆141Updated 4 years ago
- ☆80Updated last year
- Lightweight version of MAPPO to help you quickly migrate to your local environment.☆663Updated 2 months ago
- ☆449Updated 6 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated 3 weeks ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆79Updated last year
- 深度强化学习各算法介绍与Pytorch实现☆53Updated 9 months ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆139Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆337Updated last month
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆32Updated 2 years ago
- Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.☆575Updated 2 years ago
- Code for paper "基于多智能体深度强化学习的车联网通信资源分配优化"☆268Updated last year
- Reinforcement learning☆30Updated this week
- ☆62Updated 2 years ago
- gym 框架下的多智能体追逃博弈强化学习平台☆15Updated last year
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆62Updated 2 years ago
- 《动手学强化学习》练习代码(Pytorch)☆15Updated 2 years ago