alirezakazemipour / SACLinks

Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.

☆28

Alternatives and similar repositories for SAC

Users that are interested in SAC are comparing it to the libraries listed below

Sorting:

zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆86Updated last year
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆110Updated 4 years ago
akjayant / PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
☆49Updated 2 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆170Updated 8 months ago
xtma / dsac
Distributional Soft Actor Critic
☆58Updated 5 years ago
datvodinh / ppo-transformer
A Reinforcement Learning Project using PPO + Transformer
☆59Updated last year
LinghengMeng / LSTM-TD3
The implementation of LSTM-TD3.
☆81Updated 2 years ago
ZhongZ-Wang / Model-Based-RL
这是一个关于基于模型的强化学习的资料，包括一些代码地址、paper、slide等。
☆44Updated 4 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
☆149Updated last year
baimingc / delay-aware-MBRL
Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".
☆27Updated 5 years ago
schneimo / ddpg-pytorch
PyTorch implementation of DDPG for continuous control tasks.
☆46Updated 5 years ago
mit-gfx / PGMORL
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
☆119Updated 4 years ago
williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆96Updated 2 years ago
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆91Updated last year
hcnoh / gail-pytorch
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
☆163Updated 3 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated this week
kantologist / multiagent-sac
Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.
☆37Updated 4 years ago
Johnny-Zhang92 / IRL-Essential-Code
Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)
☆33Updated 3 years ago
liuzuxin / FSRL
🚀 A fast safe reinforcement learning library in PyTorch
☆202Updated 9 months ago
BY571 / SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆53Updated 3 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆86Updated last year
XinJingHao / PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
☆156Updated last year
MyRepositories-hub / Simple-Policy-Optimization
☆66Updated this week
Felhof / DiscreteSAC
☆40Updated 3 years ago
LucasCJYSDL / HierAIRL
A novel Hierarchical Imitation Learning algorithm based on AIRL.
☆22Updated 2 years ago
danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
montaserFath / BCO
behavior cloning from observation
☆35Updated 4 years ago