facebookresearch / reward-estimator-corlLinks

Reward Estimation for Variance Reduction in Deep Reinforcement Learning

☆22

Alternatives and similar repositories for reward-estimator-corl

Users that are interested in reward-estimator-corl are comparing it to the libraries listed below

Sorting:

facebookresearch / ddr
Decoupling Dynamics and Reward for Transfer Learning
☆16Updated 6 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
facebookresearch / measuring-emergent-comm
On the pitfalls of measuring emergent communication
☆34Updated 6 years ago
rddy / isql
Inferring beliefs about dynamics from behavior
☆29Updated 7 years ago
TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
Feryal / craft-env
☆44Updated 6 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 7 years ago
flowersteam / geppg
☆35Updated 6 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
idlrl / flare
RL framework for embodied agents based on PyTorch
☆11Updated 6 years ago
facebookresearch / modeling_long_term_future
Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future
☆50Updated 6 years ago
facebookresearch / td-delta
Separating value functions across time-scales.
☆17Updated 6 years ago
shagunsodhani / memory-augmented-self-play
PyTorch implementation of Memory Augmented Self-Play
☆52Updated 4 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆96Updated 6 years ago
ethanluoyc / e2c-pytorch
E2C implementation in PyTorch
☆43Updated 8 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35Updated 7 years ago
willwhitney / dynamics-aware-embeddings
Official implementation of DynE, Dynamics-aware Embeddings for RL
☆43Updated 4 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 4 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 7 years ago
facebookresearch / M3RL
Mind-aware Multi-agent Management Reinforcement Learning
☆82Updated 6 years ago
voot-t / guide-actor-critic
Keras implementation of guide actor-critic for continuous control
☆10Updated 7 years ago
lmb-freiburg / td-or-not-td
Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…
☆12Updated 6 years ago
EndingCredits / Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
☆58Updated 7 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆63Updated last year
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
Shallow-Updates-for-Deep-RL / Shallow_Updates_for_Deep_RL
Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"
☆18Updated 7 years ago
AdeelMufti / RL-RND
Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation
☆31Updated 6 years ago
quanvuong / Supervised_Policy_Update
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Updated 2 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
TomZahavy / GrayingTheBox
Code implementation of: "Graying the black box: Understanding DQNs"
☆20Updated 8 years ago