alirezakazemipour / SACLinks
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆29Updated 7 months ago
Alternatives and similar repositories for SAC
Users that are interested in SAC are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆134Updated 10 months ago
- Distributional Soft Actor Critic☆59Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆74Updated 6 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆156Updated last year
- DecentralizedLearning☆24Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated last year
- Implementation of PPO Lagrangian in PyTorch☆54Updated 3 years ago
- Transformer in RL for decision-making☆103Updated 2 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆124Updated 5 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆224Updated last year
- Implementing the two pioneering IRL papers "Algorithms for Inverse Reinforcement Learning" - (Ng &Russell 2000) and "Maximum Entropy Inve…☆31Updated 2 years ago
- Model Predictive Actor-Critic Reinforcement Learning☆68Updated 4 years ago
- ☆67Updated 5 months ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆62Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- PyTorch implementation of DDPG for continuous control tasks.☆46Updated 5 years ago
- ☆27Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆96Updated last year
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆70Updated last year
- ☆55Updated 6 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Updated 4 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆45Updated 5 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆198Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆143Updated last year
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆61Updated 5 years ago