NoneJou072 / rl-notebookLinks
深度强化学习各算法介绍与Pytorch实现
☆68Updated last year
Alternatives and similar repositories for rl-notebook
Users that are interested in rl-notebook are comparing it to the libraries listed below
Sorting:
- ☆105Updated 2 months ago
- reinforcement learning algorithm for mapless navigation☆69Updated 4 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆156Updated last year
- ☆87Updated 2 months ago
- ☆52Updated 3 months ago
- PPO, DDPG, SAC implementation on mujoco environment☆117Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆91Updated 2 years ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆19Updated 4 years ago
- Exploring the performance of Prioritized Experience Replay (PER) with the DDPG+HER scheme on the Fetch Robotics Environemnt☆14Updated 4 years ago
- A Reinforcement Learning Project using PPO + LSTM☆93Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆84Updated 5 months ago
- Intelligent control algorithm and simulation environment.☆17Updated 5 years ago
- Code for the paper "Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sam…☆13Updated 10 months ago
- NeurIPS 2024 DACER☆138Updated last month
- Pytorch implementations of various Deep Reinforcement Learning algorithms on pybullet environments.☆30Updated 3 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆396Updated 2 months ago
- DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.☆34Updated 3 years ago
- ☆59Updated 2 months ago
- ☆16Updated 3 years ago
- ☆111Updated 2 years ago
- SAC, PPO, A2C implementation on Mujoco environments : Humanoid-v4, Ant-v4, Cheetah-v4 . Includes reward manipulation.☆28Updated 2 weeks ago
- Robot arm control using reinforcement learning algorithms : DDPG and TD3 with hindsight experience replay (HER)☆79Updated last year
- TD3 in Pytorch☆35Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆52Updated 6 months ago
- 动手学强化学习代码☆60Updated last year
- Official implementation for the UOF paper (algorithm & environment)☆33Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- A implementation for soving reach target task based on TD3 with HER using PaddlePaddle.☆12Updated 5 years ago
- Source Code☆204Updated last year
- Transformer in RL for decision-making☆100Updated 2 years ago