microsoft / oac-exploreLinks

Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)

☆69

Alternatives and similar repositories for oac-explore

Users that are interested in oac-explore are comparing it to the libraries listed below

Sorting:

pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆88Updated 4 years ago
ezliu / dream
Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning
☆90Updated 2 years ago
Hwhitetooth / lirpg
☆61Updated 7 years ago
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆161Updated 5 years ago
ben-eysenbach / sac
Soft Actor-Critic
☆151Updated 7 years ago
russellmendonca / maesn_suite
☆43Updated 6 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆192Updated 2 years ago
rraileanu / auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆102Updated 2 years ago
hiwonjoon / ICML2019-TREX
☆84Updated 4 years ago
thanard / me-trpo
☆92Updated last year
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆139Updated 2 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆54Updated 4 months ago
spitis / mrl
☆113Updated 2 years ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆149Updated 3 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆190Updated 2 years ago
implementation-matters / code-for-paper
☆111Updated 5 years ago
mila-iqia / spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆161Updated 3 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆95Updated last month
toshikwa / slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆93Updated last year
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
geyang / e-maml
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆16Updated 5 years ago
alexlee-gk / slac
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆151Updated 4 years ago
jonasrothfuss / ProMP
Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…
☆238Updated 2 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago