BY571/SAC_discrete

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BY571/SAC_discrete)

BY571 / SAC_discrete

PyTorch implementation of the discrete Soft-Actor-Critic algorithm.

☆55

Alternatives and similar repositories for SAC_discrete

Users that are interested in SAC_discrete are comparing it to the libraries listed below

Sorting:

Felhof / DiscreteSAC
View on GitHub
☆40Nov 17, 2021Updated 4 years ago
nisheeth-golakiya / hybrid-sac
View on GitHub
Single-file pytorch implementation of hybrid-SAC
☆65Jun 25, 2021Updated 4 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
BY571 / CQL
View on GitHub
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆147May 6, 2024Updated last year
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated 11 months ago
tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
XinJingHao / SAC-Discrete-Pytorch
View on GitHub
A clean and robust Pytorch implementation of SAC on discrete action space
☆42Oct 23, 2024Updated last year
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆11Oct 8, 2021Updated 4 years ago
Sirius2048 / DRL-TE
View on GitHub
Load balancing based on reinforcement learning.
☆11Oct 11, 2020Updated 5 years ago
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated last year
ffelten / MASAC
View on GitHub
Jax and Torch Multi-Agent SAC on PettingZoo API
☆100Nov 23, 2024Updated last year
maywind23 / LSTM-RL
View on GitHub
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…
☆18Oct 18, 2022Updated 3 years ago
XinJingHao / Actor-Sharer-Learner
View on GitHub
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆22Dec 29, 2024Updated last year
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated last year
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
XinJingHao / Sparrow-V0
View on GitHub
A Reinforcement Learning Friendly Simulator for Mobile Robot
☆19Jan 5, 2025Updated last year
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
dennisl88 / rand_param_envs
View on GitHub
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Feb 14, 2019Updated 7 years ago
sherlockHSY / Reinforcement_learning_with_pytorch
View on GitHub
Implement some algorithms of RL
☆46Mar 28, 2023Updated 2 years ago
FanmingL / ESCP
View on GitHub
Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy
☆20Jun 1, 2022Updated 3 years ago
Tinker-Twins / AutoDRIVE-Coopertitive-MARL
View on GitHub
Multi-Agent Deep Reinforcement Learning for Cooperative and Competitive Autonomous Vehicles
☆26Aug 1, 2025Updated 7 months ago
S-Lab-System-Group / ChronusArtifact
View on GitHub
☆23Jan 7, 2022Updated 4 years ago
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated last year
Linear95 / DSP
View on GitHub
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated last year
yobibyte / unitary-scalarization-dmtl
View on GitHub
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
☆21Mar 8, 2023Updated 2 years ago
ammarhydr / SAC-Lagrangian
View on GitHub
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
☆61Jul 11, 2022Updated 3 years ago
keep9oing / DRQN-Pytorch-CartPole-v1
View on GitHub
Deep recurrent Q learning on CartPole-v1 environment
☆94Jan 15, 2024Updated 2 years ago
FanmingL / Recurrent-Offpolicy-RL
View on GitHub
Implementation of SAC and TD3 based on various RNN and Transformer.
☆28Sep 28, 2024Updated last year
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 3 years ago
facebookresearch / ede
View on GitHub
Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".
☆27Jul 6, 2023Updated 2 years ago
denisyarats / pytorch_sac
View on GitHub
PyTorch implementation of Soft Actor-Critic (SAC)
☆588Dec 5, 2021Updated 4 years ago
VD2410 / Multi-Agent-Path-Finding
View on GitHub
Implement a single- angle solver, namely space-time A*, and parts of three MAPF solvers, namely prioritized planning, Conflict-Based Sear…
☆26Apr 14, 2020Updated 5 years ago
indigoLovee / D3QN
View on GitHub
D3QN Pytorch
☆69Dec 13, 2021Updated 4 years ago
Zyxiv / -Ares2018
View on GitHub
☆11Sep 17, 2018Updated 7 years ago
liyc-ai / RL-pytorch
View on GitHub
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
☆26Jan 27, 2026Updated last month
chenf-ai / Multi-Agent-Communication-Considering-Representation-Learning
View on GitHub
☆30Dec 22, 2022Updated 3 years ago
aianaconda / PyTorch_BERT_NLP_BOOK
View on GitHub
☆29Dec 10, 2021Updated 4 years ago
Metro1998 / hppo-in-traffic-signal-control
View on GitHub
☆71May 9, 2024Updated last year
Lizhi-sjtu / DRL-code-pytorch
View on GitHub
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
☆1,441Mar 29, 2023Updated 2 years ago