johannah / bootstrap_dqnLinks

Implementation of Bootstrap DQN and Randomized Prior Functions on ALE

☆54

Alternatives and similar repositories for bootstrap_dqn

Users that are interested in bootstrap_dqn are comparing it to the libraries listed below

Sorting:

dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆27Updated 5 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆103Updated 4 years ago
microsoft / oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆69Updated last year
pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆88Updated 4 years ago
ermongroup / MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
☆73Updated 2 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆139Updated 2 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆190Updated 2 years ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago
maximilianigl / DVRL
Deep Variational Reinforcement Learning
☆136Updated 3 years ago
yusukeurakami / dreamer-pytorch
pytorch-implementation of Dreamer (Model-based Image RL Algorithm)
☆166Updated 6 months ago
mengf1 / CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
☆65Updated 5 years ago
thanard / me-trpo
☆92Updated last year
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆161Updated 5 years ago
SwapnilPande / MOReL
Model-Based Offline Reinforcement Learning
☆51Updated 4 years ago
toshikwa / slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆93Updated last year
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
spitis / mrl
☆113Updated 2 years ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆149Updated 3 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆192Updated 2 years ago
hiwonjoon / ICML2019-TREX
☆84Updated 4 years ago
xkianteb / dril
Disagreement-Regularized Imitation Learning
☆30Updated 4 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago