facebookresearch / level-replayLinks

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

☆88

Alternatives and similar repositories for level-replay

Users that are interested in level-replay are comparing it to the libraries listed below

Sorting:

evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆100Updated 3 years ago
rraileanu / idaac
☆54Updated last year
yifan12wu / rl-laplacian
Learning Laplacian Representations in Reinforcement Learning
☆17Updated 4 years ago
ahmed-touati / controllable_agent
☆46Updated 2 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 5 years ago
toshikwa / rljax
A collection of RL algorithms written in JAX.
☆101Updated 3 years ago
google-research / deep_ope
☆86Updated 11 months ago
mila-iqia / spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆161Updated 3 years ago
RajGhugare19 / alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆81Updated 2 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆181Updated 3 years ago
facebookresearch / impact-driven-exploration
impact-driven-exploration
☆131Updated last year
spitis / mrl
☆113Updated 2 years ago
microsoft / oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆69Updated last year
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
younggyoseo / RE3
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆69Updated 4 years ago
kzl / lifelong_rl
Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…
☆107Updated 3 years ago
ezliu / dream
Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning
☆90Updated 2 years ago
younggyoseo / CaDM
CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning
☆63Updated 5 years ago
yusukeurakami / dreamer-pytorch
pytorch-implementation of Dreamer (Model-based Image RL Algorithm)
☆166Updated 6 months ago
denisyarats / exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
☆115Updated 3 years ago
rraileanu / auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆102Updated 2 years ago
geyang / e-maml
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆16Updated 5 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆69Updated 6 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆192Updated 2 years ago
pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
tedmoskovitz / TOP
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Updated 2 years ago
deep-skill-chaining / deep-skill-chaining
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆29Updated 5 years ago
toshikwa / slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆93Updated last year