Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
☆1,405Nov 29, 2023Updated 2 years ago
Alternatives and similar repositories for softlearning
Users that are interested in softlearning are comparing it to the libraries listed below
Sorting:
- Soft Actor-Critic☆1,212Nov 29, 2023Updated 2 years ago
- Collection of reinforcement learning algorithms☆2,867Jun 17, 2024Updated last year
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆531Nov 22, 2022Updated 3 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,033Jul 14, 2023Updated 2 years ago
- PyTorch implementation of soft actor critic☆936Jul 17, 2025Updated 7 months ago
- A collection of reference environments for offline reinforcement learning☆1,649Nov 18, 2024Updated last year
- Reinforcement Learning in PyTorch☆2,275Jan 4, 2021Updated 5 years ago
- A toolkit for reproducible reinforcement learning research.☆2,082May 4, 2023Updated 2 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆656Apr 6, 2021Updated 4 years ago
- A fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆4,323Sep 4, 2022Updated 3 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,044Jun 10, 2023Updated 2 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆473Jul 6, 2023Updated 2 years ago
- [RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering☆375Nov 22, 2022Updated 3 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆506Dec 1, 2022Updated 3 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning☆1,747Jan 20, 2026Updated last month
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,272Feb 9, 2021Updated 5 years ago
- Learning Latent Dynamics for Planning from Pixels☆1,234Mar 24, 2023Updated 2 years ago
- A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.☆1,198Oct 17, 2022Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆588Dec 5, 2021Updated 4 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,650Aug 1, 2024Updated last year
- S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics☆655Apr 5, 2021Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆253May 3, 2020Updated 5 years ago
- Guided Policy Search☆604Feb 9, 2021Updated 5 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,407Apr 16, 2024Updated last year
- An educational resource to help anyone learn deep reinforcement learning.☆11,607Aug 5, 2024Updated last year
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,470Dec 7, 2022Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,335Mar 13, 2025Updated 11 months ago
- bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent☆1,529Apr 13, 2024Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆581Sep 10, 2021Updated 4 years ago
- Code for conservative Q-learning☆472Dec 7, 2021Updated 4 years ago
- Code for the paper "Exploration by Random Network Distillation"☆930Oct 1, 2020Updated 5 years ago
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.☆4,457Feb 13, 2026Updated 2 weeks ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- Multitask Environments for RL☆281Aug 23, 2021Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year