a clean and robust Pytorch implementation of SAC on continuous action space
☆91Apr 13, 2025Updated 10 months ago
Alternatives and similar repositories for SAC-Continuous-Pytorch
Users that are interested in SAC-Continuous-Pytorch are comparing it to the libraries listed below
Sorting:
- A clean and robust Pytorch implementation of TD3 on continuous action space☆30Jun 8, 2024Updated last year
- ☆22Jan 14, 2023Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆42Oct 23, 2024Updated last year
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆72Jun 8, 2024Updated last year
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).☆11Jul 22, 2022Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆170Jun 8, 2024Updated last year
- Lithium-ion Battery Single Particle Model with Electrolyte and thermal dynamics, including degradation modes.☆17Jun 16, 2023Updated 2 years ago
- Using diffusion model to reach controllable end-to-end driving with Carla simulation environment.☆28Mar 18, 2025Updated 11 months ago
- AI Studio x vLoong能源AI挑战赛——异常检测赛 A榜 TOP11 B榜 TOP8 方案☆15May 21, 2023Updated 2 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆165Nov 1, 2023Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- ☆20May 29, 2023Updated 2 years ago
- ☆20Oct 28, 2022Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆40Jul 28, 2022Updated 3 years ago
- A Reinforcement Learning Friendly Simulator for Mobile Robot☆19Jan 5, 2025Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆433Dec 1, 2025Updated 3 months ago
- An OpenAIGym-based framework allowing to test Delay-Aware Deep Reinforcement Learning algorithms for cooperative multi-UAV systems in ful…☆57Sep 25, 2025Updated 5 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆97May 21, 2023Updated 2 years ago
- Life cycle prediction model for batteries☆27Mar 4, 2020Updated 6 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Jun 9, 2020Updated 5 years ago
- ☆26Sep 29, 2021Updated 4 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Jul 24, 2023Updated 2 years ago
- ☆15May 20, 2025Updated 9 months ago
- Battery model with simple resistance and thevenin first order equivalent circuit approach☆10Dec 5, 2021Updated 4 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Jan 19, 2023Updated 3 years ago
- Using deep deterministic policy gradients to control a tiltrotor UAV through its transition in continuous state space☆38Nov 6, 2019Updated 6 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Sep 19, 2021Updated 4 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆158Jul 10, 2024Updated last year
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Project creates BEV image of the given multi sensor data from CARLA simulator.☆12Oct 23, 2023Updated 2 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- GAN: An example for generating Gaussian distribution by a simple generating adversarial network.☆12Dec 28, 2020Updated 5 years ago
- The Python code of a deep-unfolding algorithm for weighted sum rate maximization (WSRMax) precoding design in multiuser MIMO systems☆12Apr 25, 2023Updated 2 years ago
- ☆11May 1, 2019Updated 6 years ago
- Implementation of Continuous Control RL Algorithms