NoneJou072 / rl-notebook
深度强化学习各算法介绍与Pytorch实现
☆54Updated 9 months ago
Alternatives and similar repositories for rl-notebook
Users that are interested in rl-notebook are comparing it to the libraries listed below
Sorting:
- reinforcement learning algorithm for mapless navigation☆68Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆75Updated last month
- TD3 in Pytorch☆33Updated 3 years ago
- ☆46Updated last month
- ☆103Updated 3 months ago
- 动手学强化学习代码☆55Updated last year
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆17Updated 4 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆150Updated 10 months ago
- ☆60Updated last week
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated last week
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 months ago
- Exploring the performance of Prioritized Experience Replay (PER) with the DDPG+HER scheme on the Fetch Robotics Environemnt☆13Updated 3 years ago
- Intelligent control algorithm and simulation environment.☆17Updated 5 years ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- rl-papers☆47Updated 2 years ago
- A Reinforcement Learning Project using PPO + LSTM☆76Updated last year
- ☆23Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆145Updated 11 months ago
- 基于pytorch的强化学习2d机械臂小实验(DDPG算法)☆37Updated 6 years ago
- NeurIPS 2024 DACER☆106Updated last week
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆343Updated last month
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆128Updated last year
- ☆16Updated 2 years ago
- 使用PPO算法+OU噪声进行机械臂轨迹规划仿真☆17Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- ☆23Updated 2 years ago