a clean and robust Pytorch implementation of SAC on continuous action space
☆94Apr 13, 2025Updated last year
Alternatives and similar repositories for SAC-Continuous-Pytorch
Users that are interested in SAC-Continuous-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A clean and robust Pytorch implementation of TD3 on continuous action space☆31Jun 8, 2024Updated last year
- ☆24Jan 14, 2023Updated 3 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- ☆35Jun 16, 2023Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆72Jun 8, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- implementation of MADDPG using PettingZoo and PyTorch☆168Nov 1, 2023Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆174Jun 8, 2024Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- ☆21May 29, 2023Updated 2 years ago
- ☆20May 7, 2023Updated 2 years ago
- Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C5…☆3,376Jun 11, 2025Updated 10 months ago
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆42Jul 28, 2022Updated 3 years ago
- A Reinforcement Learning Friendly Simulator for Mobile Robot☆19Jan 5, 2025Updated last year
- UAV trajectory design (DQN)☆22Oct 5, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code accompanying the paper Hovell, K., Ulrich, S., and Bronz, M., “Acceleration-based Quadrotor Guidance Under Time Delays Using Deep Re…☆13Dec 1, 2020Updated 5 years ago
- Implement reinforcement learning algorithms in Pytorch☆34Jun 7, 2021Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆106Jun 9, 2020Updated 5 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 6 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆441Dec 1, 2025Updated 5 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆99May 21, 2023Updated 2 years ago
- An OpenAIGym-based framework allowing to test Delay-Aware Deep Reinforcement Learning algorithms for cooperative multi-UAV systems in ful…☆57Sep 25, 2025Updated 7 months ago
- Using diffusion model to reach controllable end-to-end driving with Carla simulation environment.☆29Mar 18, 2025Updated last year
- Constrained Policy Optimization implementation on Safety Gym☆30Jan 8, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 探索深度强化学习在自动驾驶决策规划中的使用☆25Nov 25, 2022Updated 3 years ago
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- planning trajectories for UAVs☆12Mar 21, 2021Updated 5 years ago
- ☆26Sep 29, 2021Updated 4 years ago
- Code base for publication: Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems☆10Feb 1, 2023Updated 3 years ago
- ☆10Jul 13, 2023Updated 2 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- [NeurIPS 2025] TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration☆24Nov 30, 2025Updated 5 months ago
- ☆20Oct 28, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CarND Capstone☆10Apr 2, 2018Updated 8 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Jul 24, 2023Updated 2 years ago
- Official implementation of "Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand☆83Apr 28, 2021Updated 5 years ago
- Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.☆1,473Mar 29, 2023Updated 3 years ago
- Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner…☆16Apr 26, 2023Updated 3 years ago
- UAV PATH TRACKING AND DYNAMIC AVOIDANCE BASED ON ADS-B AND DEEP REINFORCEMENT LEARNING for Univerisity of Bristol RP3 final☆12Apr 18, 2023Updated 3 years ago