NoneJou072 / rl-notebookLinks
深度强化学习各算法介绍与Pytorch实现
☆54Updated 10 months ago
Alternatives and similar repositories for rl-notebook
Users that are interested in rl-notebook are comparing it to the libraries listed below
Sorting:
- a clean and robust Pytorch implementation of SAC on continuous action space☆77Updated last month
- TD3 in Pytorch☆34Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆89Updated 2 years ago
- reinforcement learning algorithm for mapless navigation☆68Updated 4 years ago
- ☆49Updated 2 months ago
- ☆103Updated 3 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- A Reinforcement Learning Project using PPO + LSTM☆80Updated last year
- 动手学强化学习代码☆57Updated last year
- ☆24Updated 2 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 2 years ago
- ☆63Updated 3 weeks ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆17Updated 4 years ago
- Exploring the performance of Prioritized Experience Replay (PER) with the DDPG+HER scheme on the Fetch Robotics Environemnt☆13Updated 3 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆150Updated 10 months ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆144Updated last year
- ☆23Updated 2 years ago
- multi-agent formation control environment implemented with MPE.☆14Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆137Updated last year
- multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS☆76Updated 2 years ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆85Updated last year
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆64Updated 2 years ago
- Multi Agent SAC and DDPG applied to path finding in a 3-dimensional grid☆12Updated 3 years ago
- The implementation of LSTM-TD3.☆81Updated 2 years ago
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated 2 weeks ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆147Updated last year
- 无人机动态覆盖控制;1. 实现了一个无人机点覆盖环境;2. 给出了无人机连通保持规则;3. 给出了基于MARL的控制算法☆49Updated 9 months ago
- rl-papers☆47Updated 2 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆39Updated 7 months ago