Reinforcement Learning with Deep Energy-Based Policies
☆438Nov 28, 2023Updated 2 years ago
Alternatives and similar repositories for softqlearning
Users that are interested in softqlearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,424Nov 29, 2023Updated 2 years ago
- Soft Actor-Critic☆1,270Nov 29, 2023Updated 2 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,064Jun 10, 2023Updated 3 years ago
- ICML 2018 Self-Imitation Learning☆276Apr 18, 2020Updated 6 years ago
- ☆161Jul 21, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆347Nov 22, 2018Updated 7 years ago
- Collection of reinforcement learning algorithms☆2,904Jun 17, 2024Updated last year
- Guided Policy Search☆599Feb 9, 2021Updated 5 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- NIPS 2017 Value Prediction Network☆166Jan 12, 2018Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆154Sep 22, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Implementation of TRPO and related algorithms☆652May 20, 2018Updated 8 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 8 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 7 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆164Jul 17, 2020Updated 5 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- ☆345Jan 24, 2018Updated 8 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 6 years ago
- ☆274Jun 5, 2018Updated 8 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆619Jul 6, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multitask Environments for RL☆282Aug 23, 2021Updated 4 years ago
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,479Dec 7, 2022Updated 3 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,901May 29, 2022Updated 4 years ago
- Code for hierarchical imitation learning and reinforcement learning☆300Mar 14, 2018Updated 8 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆548Nov 22, 2022Updated 3 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆475Jul 6, 2023Updated 2 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆730Nov 22, 2018Updated 7 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆377Oct 15, 2021Updated 4 years ago
- DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.☆2,171Apr 2, 2023Updated 3 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆835Apr 2, 2023Updated 3 years ago
- An implementation of the Augmented Random Search algorithm☆429Sep 29, 2021Updated 4 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆663Apr 6, 2021Updated 5 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆422Feb 13, 2019Updated 7 years ago