PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆58Oct 1, 2021Updated 4 years ago
Alternatives and similar repositories for SAC_discrete
Users that are interested in SAC_discrete are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆40Nov 17, 2021Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆317Jul 25, 2024Updated last year
- PyTorch implementation of discrete version of Soft Actor-Critic.☆37Sep 19, 2021Updated 4 years ago
- Single-file pytorch implementation of hybrid-SAC☆68Jun 25, 2021Updated 5 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆21Jul 27, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A clean and robust Pytorch implementation of SAC on discrete action space☆43Oct 23, 2024Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆146May 6, 2024Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆101Nov 23, 2024Updated last year
- Load balancing based on reinforcement learning.☆11Oct 11, 2020Updated 5 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Color: Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity☆22Dec 23, 2024Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- ☆10Oct 15, 2020Updated 5 years ago
- Code for experimenting with load-balancing intradomain traffic engineering using GNNs and RL. Project as part of masters degree at the Un…☆38Jan 12, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated 2 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆12Oct 8, 2021Updated 4 years ago
- Reference code for the paper ""Centroid-Guided Target-Driven Topology Control Method for UAV Ad-Hoc Networks Based on Tiny Deep Reinforce…☆13Oct 21, 2024Updated last year
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆63Jul 11, 2022Updated 3 years ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆18Oct 18, 2022Updated 3 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Curiosity-driven Exploration by Self-supervised Prediction☆147Mar 12, 2023Updated 3 years ago
- A Reinforcement Learning Friendly Simulator for Mobile Robot☆18Jan 5, 2025Updated last year
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.☆13Dec 8, 2021Updated 4 years ago
- Code Repository for the NeurIPS 2022 paper: "Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights".☆18Jul 10, 2024Updated last year
- D3QN Pytorch☆70Dec 13, 2021Updated 4 years ago
- Implementation of Dueling Network Architectures for Deep Reinforcement Learning paper with Pytorch☆14Sep 26, 2020Updated 5 years ago
- ☆16Aug 26, 2025Updated 10 months ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆21Jun 1, 2022Updated 4 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆13Oct 23, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Flatland Multi Agent Reinforcement Learning☆16Aug 1, 2020Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆95Jan 15, 2024Updated 2 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆20Sep 14, 2025Updated 9 months ago
- Based on NS-3, design new GPSR routing protocols.☆13May 27, 2018Updated 8 years ago
- Multi Agent SAC and DDPG applied to path finding in a 3-dimensional grid☆15Aug 8, 2021Updated 4 years ago
- code for☆11Apr 10, 2021Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,347Mar 13, 2025Updated last year