ac-93 / soft-actor-criticView external linksLinks
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆99Jun 22, 2020Updated 5 years ago
Alternatives and similar repositories for soft-actor-critic
Users that are interested in soft-actor-critic are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆253May 3, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Jun 9, 2020Updated 5 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- ☆10Sep 3, 2021Updated 4 years ago
- Revisiting Rainbow☆75Jun 9, 2021Updated 4 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Dec 17, 2019Updated 6 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆586Dec 5, 2021Updated 4 years ago
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 4 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 4 years ago
- PyTorch implementation of soft actor critic☆933Jul 17, 2025Updated 6 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 2 years ago
- Fast Online Adaptation in Robotics through Meta-Learning Embeddings of Simulated Priors☆19Apr 14, 2021Updated 4 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆163Dec 21, 2021Updated 4 years ago
- PyTorch implementation of deep reinforcement learning algorithms☆491Nov 19, 2021Updated 4 years ago
- PyTorch implementation of SimPLe (Simulated Policy Learning) on the Atari 100k benchmark.☆16Dec 7, 2022Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆14Apr 26, 2019Updated 6 years ago
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆34Mar 29, 2023Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆38Jun 3, 2023Updated 2 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆530Nov 22, 2022Updated 3 years ago
- ☆35Sep 5, 2020Updated 5 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆97Sep 3, 2020Updated 5 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.☆38Jun 22, 2022Updated 3 years ago