Reinforcement Learning with Deep Energy-Based Policies
☆436Nov 28, 2023Updated 2 years ago
Alternatives and similar repositories for softqlearning
Users that are interested in softqlearning are comparing it to the libraries listed below
Sorting:
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,416Nov 29, 2023Updated 2 years ago
- Soft Actor-Critic☆1,230Nov 29, 2023Updated 2 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,050Jun 10, 2023Updated 2 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆349Nov 22, 2018Updated 7 years ago
- Collection of reinforcement learning algorithms☆2,877Jun 17, 2024Updated last year
- Guided Policy Search☆605Feb 9, 2021Updated 5 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Implementation of TRPO and related algorithms☆648May 20, 2018Updated 7 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 2 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- ☆345Jan 24, 2018Updated 8 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 5 years ago
- ☆276Jun 5, 2018Updated 7 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆618Jul 6, 2023Updated 2 years ago
- Multitask Environments for RL☆282Aug 23, 2021Updated 4 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,473Dec 7, 2022Updated 3 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,882May 29, 2022Updated 3 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 8 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆472Jul 6, 2023Updated 2 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆731Nov 22, 2018Updated 7 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆375Oct 15, 2021Updated 4 years ago
- DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.☆2,169Apr 2, 2023Updated 2 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆833Apr 2, 2023Updated 2 years ago
- An implementation of the Augmented Random Search algorithm☆428Sep 29, 2021Updated 4 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆660Apr 6, 2021Updated 4 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago