DBaudry / Sub-Sampling-Dueling-Algorithms-Neurips20Links

☆9

Alternatives and similar repositories for Sub-Sampling-Dueling-Algorithms-Neurips20

Users that are interested in Sub-Sampling-Dueling-Algorithms-Neurips20 are comparing it to the libraries listed below

Sorting:

abbyvansoest / maxent
☆14Updated 6 years ago
DBaudry / Information_Directed_Sampling
Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
☆21Updated 6 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
joeybose / FloRL
Implicit Normalizing Flows + Reinforcement Learning
☆61Updated 6 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Updated 7 months ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
albertometelli / wql
☆9Updated 5 years ago
sebascuri / hucrl
☆30Updated last year
younggyoseo / RE3
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆68Updated 3 years ago
clvoloshin / COBS
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 2 years ago
wendelinboehmer / dcg
☆76Updated last year
tessavdheiden / social_empowerment
☆17Updated 10 months ago
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
rlai-lab / Regularized-GradientTD
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆38Updated 4 years ago
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
iosband / TabulaRL
☆65Updated last year
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆49Updated 3 years ago
mcmachado / options
☆43Updated 8 years ago
Feryal / craft-env
☆44Updated 6 years ago
victorcampos7 / edl
Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
☆37Updated 5 years ago
lcalem / reproduction-soft-qlearning-mutual-information
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Updated 6 years ago
koulanurag / mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
☆50Updated 2 years ago
google-research / dice_rl
☆104Updated 10 months ago
google-research / deep_ope
☆86Updated 10 months ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 4 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago