seungjaeryanlee / playing-hard-exploration-games-by-watching-youtube

[WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)

☆12

Related projects ⓘ

Alternatives and complementary repositories for playing-hard-exploration-games-by-watching-youtube

zuoxingdong / dm2gym
Convert DeepMind Control Suite to OpenAI gym environments.
☆83Updated 4 years ago
rraileanu / auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆102Updated last year
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆66Updated 5 years ago
yusukeurakami / plan2explore-pytorch
☆41Updated 3 years ago
WilsonWangTHU / neural_graph_evolution
☆45Updated last year
stanford-iprl-lab / GRAC
implementation of our self-guided and self-regularized actor-critic algorithm
☆30Updated last year
ruizhaogit / EnergyBasedPrioritization
Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)
☆33Updated 5 years ago
AIcrowd / neurips2020-procgen-starter-kit
Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd
☆90Updated last year
ashedwards / ILPO
Official implementation of ICML paper Imitating Latent Policies from Observation
☆73Updated 5 years ago
sparisi / cbet
Change-Based Exploration Transfer
☆37Updated 2 years ago
AIcrowd / real_robots
Gym environments for Robots that learn to interact with the environment autonomously
☆34Updated last year
dhruvramani / model-based-atari
An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"
☆15Updated 5 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
pathak22 / exploration-by-disagreement
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
☆123Updated 5 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆23Updated 5 years ago
ermongroup / InfoGAIL
Source code for our NIPS 2017 paper, InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations
☆42Updated 7 years ago
ppocma / ppocma
☆71Updated 5 years ago
snu-mllab / EMI
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆36Updated 3 years ago
tshrjn / env-zoo
A curated list of reinforcement learning environments and frameworks.
☆50Updated 5 years ago
xuanlinli17 / iclr2021_rlreg
Regularization Matters in Policy Optimization
☆20Updated 3 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆74Updated 2 years ago
paulorauber / hpg
Hindsight policy gradients
☆43Updated 4 years ago
victorcampos7 / edl
Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
☆36Updated 4 years ago
nicklashansen / policy-adaptation-during-deployment
Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.
☆112Updated 4 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 4 years ago
nnaisense / MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆78Updated 5 years ago
tgangwani / RL-Indirect-imitation
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
☆19Updated 4 years ago
MaxSobolMark / HardRLWithYoutube
TensorFlow implementation of "Playing hard exploration games by watching YouTube"
☆37Updated 5 years ago
catalyst-team / catalyst-rl
☆46Updated 3 years ago
jhejna / hierarchical_morphology_transfer
Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"
☆17Updated last year