a clean and robust Pytorch implementation of SAC on continuous action space
☆94Apr 13, 2025Updated last year
Alternatives and similar repositories for SAC-Continuous-Pytorch
Users that are interested in SAC-Continuous-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A clean and robust Pytorch implementation of TD3 on continuous action space☆31Jun 8, 2024Updated last year
- ☆24Jan 14, 2023Updated 3 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆43Oct 23, 2024Updated last year
- ☆35Jun 16, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A clean and robust Pytorch implementation of PPO on continuous action space.☆173Jun 8, 2024Updated last year
- implementation of MADDPG using PettingZoo and PyTorch☆167Nov 1, 2023Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- ☆19May 7, 2023Updated 2 years ago
- Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C5…☆3,344Jun 11, 2025Updated 10 months ago
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆41Jul 28, 2022Updated 3 years ago
- A Reinforcement Learning Friendly Simulator for Mobile Robot☆19Jan 5, 2025Updated last year
- UAV trajectory design (DQN)☆22Oct 5, 2021Updated 4 years ago
- Code accompanying the paper Hovell, K., Ulrich, S., and Bronz, M., “Acceleration-based Quadrotor Guidance Under Time Delays Using Deep Re…☆13Dec 1, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implement reinforcement learning algorithms in Pytorch☆34Jun 7, 2021Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆106Jun 9, 2020Updated 5 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).☆11Jul 22, 2022Updated 3 years ago
- Trajectory Optimization and Computing Offloading Strategy in UAV-Assisted MEC System☆191Jul 11, 2022Updated 3 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆438Dec 1, 2025Updated 4 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆99May 21, 2023Updated 2 years ago
- An OpenAIGym-based framework allowing to test Delay-Aware Deep Reinforcement Learning algorithms for cooperative multi-UAV systems in ful…☆58Sep 25, 2025Updated 6 months ago
- Using diffusion model to reach controllable end-to-end driving with Carla simulation environment.☆29Mar 18, 2025Updated last year
- Constrained Policy Optimization implementation on Safety Gym☆30Jan 8, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Lithium-ion Battery Single Particle Model with Electrolyte and thermal dynamics, including degradation modes.☆17Jun 16, 2023Updated 2 years ago
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- ☆26Sep 29, 2021Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- ☆10Jul 13, 2023Updated 2 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- [NeurIPS 2025] TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration☆21Nov 30, 2025Updated 4 months ago
- ☆20Oct 28, 2022Updated 3 years ago
- CarND Capstone☆10Apr 2, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Jul 24, 2023Updated 2 years ago
- A CPP Console application that uses ftxui, and find the path between two points using diferent algoritms.☆14Jan 28, 2025Updated last year
- Official implementation of "Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand