MiniHackPlanet / MiniHack

☆10

Related projects: ⓘ

flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 6 years ago
kkhetarpal / ioc
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Updated 4 years ago
iosband / TabulaRL
☆65Updated 6 months ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆30Updated 4 years ago
mcmachado / options
☆42Updated 7 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆47Updated 2 years ago
philipjball / OffCon3
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆24Updated 3 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆61Updated last year
chloechsu / revisiting-ppo
☆47Updated 3 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 6 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆83Updated 3 years ago
evgenii-nikishin / omd
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Updated 3 years ago
jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆39Updated last year
nnaisense / MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆77Updated 5 years ago
flowersteam / geppg
☆35Updated 6 years ago
JohanSamir / revisiting_rainbow
Revisiting Rainbow
☆73Updated 3 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 6 years ago
bstadie / krazyworld
krazy grid world
☆25Updated 4 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆33Updated 6 years ago
YuhangSong / Arena-Baselines-Depreciated
☆35Updated this week
lerrytang / train-procgen-pfrl
PyTorch code to train and evaluate Procgen tasks
☆23Updated 3 years ago
RonanFR / UCRL
☆25Updated 5 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆90Updated 6 years ago
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆20Updated 3 years ago
Feryal / craft-env
☆44Updated 5 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 6 years ago
tesatory / hsp
Hierarchical Self-Play
☆21Updated 5 years ago
brain-research / mirage-rl
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Updated 6 years ago
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆43Updated 6 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 3 years ago