david-abel / rl_info_theoryLinks

A collection of code investigating the use of information theory for abstractions in RL

☆16

Alternatives and similar repositories for rl_info_theory

Users that are interested in rl_info_theory are comparing it to the libraries listed below

Sorting:

flowersteam / geppg
☆35Updated 6 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆67Updated 7 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆96Updated 4 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
shagunsodhani / memory-augmented-self-play
PyTorch implementation of Memory Augmented Self-Play
☆52Updated 4 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 7 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆96Updated 6 years ago
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆44Updated 6 years ago
facebookresearch / modeling_long_term_future
Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future
☆50Updated 6 years ago
jeappen / gym-grid
A simple Gridworld environment for Open AI gym
☆25Updated 7 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 4 years ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆44Updated 6 years ago
jachiam / surprise
Surprise-based intrinsic motivation for deep reinforcement learning
☆20Updated 8 years ago
ofirnachum / models
Models built with TensorFlow
☆25Updated 6 years ago
wassname / world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…
☆32Updated 6 years ago
AnujMahajanOxf / VIREL
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Updated 5 years ago
evgenii-nikishin / omd
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Updated 4 years ago
supratikp / HOOF
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆19Updated 5 years ago
njustesen / a2c_gvgai
A2C for GVG-AI
☆22Updated 6 years ago
rddy / isql
Inferring beliefs about dynamics from behavior
☆29Updated 7 years ago
ethanluoyc / e2c-pytorch
E2C implementation in PyTorch
☆43Updated 8 years ago
ShibiHe / Q-Optimality-Tightening
This is my implementation of the Optimality Tightening
☆37Updated 8 years ago
R-McHenry / ParallelizedGoExplore
A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post
☆46Updated 6 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24Updated 6 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
chenxy99 / Generative-Temporal-Models-with-Spatial-Memory
Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"
☆29Updated 3 years ago