LihaoR / Entropy-Regularized-RLLinks

soft q learning and soft actor critic

☆15

Alternatives and similar repositories for Entropy-Regularized-RL

Users that are interested in Entropy-Regularized-RL are comparing it to the libraries listed below

Sorting:

jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
tesslerc / GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Updated 5 years ago
hejia-zhang / awesome-model-based-reinforcement-learning
A curated list of awesome Model-based reinforcement learning resources
☆94Updated 4 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
Steven-Ho / VALOR
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Updated 6 years ago
yashbonde / Transformer-RL
Experiments to train transformer network to master reinforcement learning environments.
☆32Updated 4 years ago
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆32Updated 2 years ago
tesslerc / ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆46Updated 6 years ago
thanard / me-trpo
☆92Updated last year
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆88Updated last year
liuzuxin / safe-mbrl
Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method
☆66Updated 2 years ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 7 years ago
karush17 / esac
Evolution-based Soft Actor-Critic (ESAC)
☆42Updated last year
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated this week
navneet-nmk / Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
☆62Updated 6 years ago
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Updated 2 years ago
apexrl / bmpo
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Updated 2 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆46Updated 3 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
YangRui2015 / Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
☆88Updated 5 years ago
hu-po / pySACQ
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆37Updated 4 years ago
feidieufo / homework
Assignments for CS294-112.
☆30Updated 5 years ago
Knoxantropicen / model-based-meta-rl
Self-implemented code for Model-Based Meta-Reinforcement Learning
☆17Updated 6 years ago
seolhokim / DistributedRL-Pytorch-Ray
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Updated 3 years ago
IouJenLiu / CMAE
☆49Updated 4 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 3 weeks ago