ycz0512 / SAC-HERLinks
Implementation of Soft Actor-Critic with Hindsight Experience Replay
☆19Updated 4 years ago
Alternatives and similar repositories for SAC-HER
Users that are interested in SAC-HER are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆24Updated 6 years ago
- reinforcement learning algorithm for mapless navigation☆69Updated 4 years ago
- Intelligent control algorithm and simulation environment.☆17Updated 5 years ago
- ☆105Updated 2 months ago
- multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS☆76Updated 3 years ago
- multi-agent formation control environment implemented with MPE.☆14Updated 3 years ago
- ☆154Updated 6 years ago
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆42Updated last year
- ☆12Updated 4 years ago
- 深度强化学习各算法介绍与Pytorch实现☆68Updated last year
- Turtlebot Robot Navigation in Gazebo based on DRL☆21Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Updated 7 months ago
- End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo☆234Updated 2 years ago
- ☆12Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆91Updated 2 years ago
- ☆23Updated 2 years ago
- Exploring the performance of Prioritized Experience Replay (PER) with the DDPG+HER scheme on the Fetch Robotics Environemnt☆14Updated 4 years ago
- A simple and fast 2D RL environment with obstacles to learn navigation.☆21Updated 6 years ago
- ☆23Updated 6 years ago
- DRL_Navigation_Robot_ROS2_Foxy☆34Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆86Updated 5 months ago
- 在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, an…☆120Updated last year
- Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)☆19Updated 10 months ago
- Using RL based controller for steering function of RRT. And a network learns to perform motion planning☆34Updated 5 years ago
- The implementation of LSTM-TD3.☆85Updated 2 years ago
- ☆54Updated 3 years ago
- Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous env…☆24Updated 3 years ago
- BipedalWalker & BipedalWalkerHardcore solved by SAC☆25Updated last year
- ☆13Updated last year