Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆99Jun 22, 2020Updated 5 years ago
Alternatives and similar repositories for soft-actor-critic
Users that are interested in soft-actor-critic are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆254May 3, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Jun 9, 2020Updated 5 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 3 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- ☆10Sep 3, 2021Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Revisiting Rainbow☆76Jun 9, 2021Updated 4 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- A python implementation for PILCO algorithm for a robotic arm - tested on mujoco robotics environment☆12Jan 8, 2020Updated 6 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Dec 17, 2019Updated 6 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- ☆148Dec 9, 2024Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆588Dec 5, 2021Updated 4 years ago
- practice☆10Jun 30, 2020Updated 5 years ago
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 4 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- PyTorch implementation of soft actor critic☆938Jul 17, 2025Updated 7 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- Fast Online Adaptation in Robotics through Meta-Learning Embeddings of Simulated Priors☆19Apr 14, 2021Updated 4 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 7 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Dec 21, 2021Updated 4 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- PyTorch implementation of deep reinforcement learning algorithms☆491Nov 19, 2021Updated 4 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 6 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆34Mar 29, 2023Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆38Jun 3, 2023Updated 2 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- PyTorch implementation of SimPLe (Simulated Policy Learning) on the Atari 100k benchmark.☆17Dec 7, 2022Updated 3 years ago
- The continuous mountain car problem solved with DDPG☆13Apr 19, 2020Updated 5 years ago