YyzHarry / SV-RLLinks

[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning

☆34

Alternatives and similar repositories for SV-RL

Users that are interested in SV-RL are comparing it to the libraries listed below

Sorting:

uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
snu-mllab / EMI
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆36Updated 4 years ago
tianjunz / MADE
☆19Updated 3 years ago
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆19Updated 4 years ago
orybkin / video-gcp
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
☆44Updated 2 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated 11 months ago
willwhitney / dynamics-aware-embeddings
Official implementation of DynE, Dynamics-aware Embeddings for RL
☆43Updated 4 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆56Updated 6 years ago
kaixin96 / mixreg
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆33Updated 4 years ago
haoliuhl / taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
☆29Updated 2 years ago
LinZichuan / AdMRL
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Updated 4 years ago
StanfordVL / ac-teach
Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers
☆24Updated 2 years ago
jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆44Updated last year
chandar-lab / Lifelong-Hanabi
A Continual Multi-agent RL testbed based on Hanabi
☆30Updated 3 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24Updated 6 years ago
quanvuong / Supervised_Policy_Update
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Updated 2 years ago
taodav / nsrs
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Updated last year
google-research / pisac
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
☆44Updated 2 years ago
illidanlab / rpg
Ranking Policy Gradient
☆23Updated 5 years ago
AutumnWu / Streamlined-Off-Policy-Learning
ICRL 2020
☆19Updated 5 years ago
roosephu / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Updated 5 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
RuohanW / RED
Implementation of Random Expert Distillation
☆29Updated 6 years ago
google-deepmind / active_ops
☆32Updated 11 months ago
rcheng805 / CORE-RL
Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…
☆32Updated 4 years ago
clvrai / new-actions-rl
☆24Updated 11 months ago
psclklnk / spdl
Source code for the Self-Paced Deep Reinforcement Learning Experiments
☆32Updated 2 years ago
zzyunzhi / vds
Code for Automatic Curriculum Learning through Value Disagreement
☆30Updated 5 years ago
sfujim / SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆17Updated 3 years ago