aditimavalankar / option-keyboardLinks

PyTorch implementation of "The Option Keyboard: Combining Skills in Reinforcement Learning" (NeurIPS 2019)

☆12

Alternatives and similar repositories for option-keyboard

Users that are interested in option-keyboard are comparing it to the libraries listed below

Sorting:

facebookresearch / impact-driven-exploration
impact-driven-exploration
☆131Updated last year
mcmachado / count_based_exploration_sr
☆31Updated 6 years ago
kenjyoung / MinAtar
☆305Updated 7 months ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆87Updated 4 years ago
spitis / mrl
☆113Updated 2 years ago
denisyarats / dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
☆217Updated last year
kkhetarpal / ioc
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Updated 4 years ago
joonleesky / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆30Updated 4 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆180Updated 3 years ago
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
rraileanu / idaac
☆54Updated last year
hiwonjoon / ICML2019-TREX
☆84Updated 4 years ago
ben-eysenbach / sac
Soft Actor-Critic
☆151Updated 7 years ago
quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆190Updated 2 years ago
mklissa / PPOC
Proximal Policy Option-Critic
☆25Updated 6 years ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆149Updated 3 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆94Updated last month
mila-iqia / spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆161Updated 3 years ago
clvoloshin / COBS
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 2 years ago
paulorauber / hpg
Hindsight policy gradients
☆45Updated 5 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆69Updated 6 years ago
yifan12wu / rl-laplacian
Learning Laplacian Representations in Reinforcement Learning
☆16Updated 4 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆138Updated 2 years ago
microsoft / oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Updated last year
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
kngwyu / Rainy
Deep RL agents with PyTorch
☆35Updated 3 years ago
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
Hwhitetooth / lirpg
☆61Updated 7 years ago