acezsq / dsx-rl
动手学强化学习代码
☆54Updated last year
Alternatives and similar repositories for dsx-rl:
Users that are interested in dsx-rl are comparing it to the libraries listed below
- TD3 in Pytorch☆33Updated 3 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆102Updated last month
- 深度强化学习各算法介绍与Pytorch实现☆53Updated 9 months ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆99Updated 2 years ago
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆62Updated 2 weeks ago
- RL algorithms☆141Updated 4 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆32Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated 3 weeks ago
- implementation of MADDPG using PettingZoo and PyTorch☆139Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- Projects from basic algorithms to MARL. Implements MADDPG/MATD3 in Predator-Prey pursuit games with PettingZoo MPE environments.☆62Updated last week
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆196Updated 2 years ago
- ☆63Updated last year
- Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch☆288Updated 5 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 2 years ago
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated this week
- A Collection of Multi-Agent Reinforcement Learning (MARL) Resources☆232Updated 2 years ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆79Updated last year
- Reinforcement learning☆30Updated this week
- This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…☆15Updated 2 years ago
- Implement some algorithms of RL☆47Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆80Updated 5 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆337Updated last month
- ☆102Updated 2 months ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆62Updated 2 years ago
- ☆62Updated 2 years ago
- ☆59Updated 3 months ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆56Updated 3 years ago
- 多智能体强化学习☆92Updated 6 years ago