WangYuxiang8 / RL-demo
强化学习常见算法的实现,Q-Learning/DQN/PG/AC/DDPG/PPO/SAC
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for RL-demo
- RL algorithms☆139Updated 3 years ago
- 多智能体强化学习☆83Updated 5 years ago
- kinds of reinforcement learning model by Pytorch☆273Updated last year
- Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch☆255Updated 4 years ago
- ☆299Updated 4 months ago
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆68Updated 2 years ago
- ☆57Updated 2 years ago
- This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based o…☆391Updated 2 months ago
- Implement some algorithms of RL☆43Updated last year
- ☆572Updated last year
- ☆336Updated last week
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆91Updated 2 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- Tutorial for Reinforcement Learning☆172Updated 2 years ago
- Digi-Metal / Reinforce-learning-based-algorithm-for-dynamic-scheduling-problem-in-steelmaking-workshop基于强化学习的炼钢动态调度求解技术和软件实现☆16Updated 4 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆64Updated 2 months ago
- 动手学强化学习代码☆35Updated 9 months ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆51Updated 4 years ago
- basic algorithms of reinforcement learning☆195Updated last year
- DDPG in Pytorch☆37Updated 2 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆17Updated 2 years ago
- 基于生物启发式算法的多智能体强化学习算法☆13Updated 3 years ago
- 指针网络+强化学习 解决旅行商(TSP)问题☆75Updated 3 years ago
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago
- Dueling DQN Pytorch☆12Updated 2 years ago
- Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.☆1,075Updated last year
- 多智能体学习库☆15Updated 2 years ago
- TD3 in Pytorch☆27Updated 2 years ago