NoneJou072 / rl-notebook
深度强化学习各算法介绍与Pytorch实现
☆49Updated 8 months ago
Alternatives and similar repositories for rl-notebook:
Users that are interested in rl-notebook are comparing it to the libraries listed below
- ☆102Updated last month
- a clean and robust Pytorch implementation of SAC on continuous action space☆70Updated 9 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆87Updated last year
- TD3 in Pytorch☆31Updated 3 years ago
- ☆23Updated 2 years ago
- rl-papers☆47Updated 2 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆151Updated 8 months ago
- ☆59Updated 2 months ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆98Updated 2 years ago
- 动手学强化学习代码☆52Updated last year
- ☆38Updated this week
- reinforcement learning algorithm for mapless navigation☆68Updated 3 years ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆16Updated 4 years ago
- A Reinforcement Learning Project using PPO + LSTM☆64Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last month
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆123Updated 11 months ago
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆70Updated 2 years ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆131Updated last year
- Intelligent control algorithm and simulation environment.☆16Updated 5 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆309Updated 2 weeks ago
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated 2 weeks ago
- ☆16Updated 2 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆103Updated 3 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆135Updated last year
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆61Updated 2 years ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆72Updated last year