sparisi / td-regLinks

TD-Regularized Actor-Critic Methods

☆36

Alternatives and similar repositories for td-reg

Users that are interested in td-reg are comparing it to the libraries listed below

Sorting:

ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
ppocma / ppocma
☆72Updated 6 years ago
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Updated 3 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
Riashat / Policy-Gradient-Reinforcement-Learning
☆37Updated 9 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
flowersteam / geppg
☆35Updated 7 years ago
hari-sikchi / safeRL
Safe Reinforcement Learning algorithms
☆74Updated 2 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
pfnet-research / capg
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Updated 7 years ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 10 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
microsoft / logrl
Logarithmic Reinforcement Learning
☆26Updated 2 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
cap-ntu / baconian-project
Model-based Reinforcement Learning Framework
☆114Updated 5 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24Updated 6 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
IouJenLiu / PIC
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning
☆49Updated 4 years ago
xkianteb / dril
Disagreement-Regularized Imitation Learning
☆30Updated 4 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35Updated 7 years ago
kindredresearch / arp
Autoregressive policies for continuous control reinforcement learning
☆32Updated 6 years ago
ajgupta93 / d3pg-pytorch
Distributed DDPG implementation in pytorch
☆9Updated 7 years ago
lerrel / gym-adv
Gym environments modified with adversarial agents
☆36Updated 8 years ago
aravindsrinivas / neural-mpc
☆73Updated 5 years ago