paintception / Deep-Quality-Value-FamilyLinks

Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning Algorithms": https://arxiv.org/abs/1909.01779 To appear at the next NeurIPS2019 DRL-Workshop

☆11

Alternatives and similar repositories for Deep-Quality-Value-Family

Users that are interested in Deep-Quality-Value-Family are comparing it to the libraries listed below

Sorting:

paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 7 years ago
DavidJanz / successor_uncertainties_atari
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Updated 2 years ago
flowersteam / geppg
☆35Updated 6 years ago
microsoft / logrl
Logarithmic Reinforcement Learning
☆26Updated 2 years ago
jeappen / gym-grid
A simple Gridworld environment for Open AI gym
☆25Updated 7 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 7 years ago
activatedgeek / torchrl
Highly Modular and Scalable Reinforcement Learning
☆115Updated 5 years ago
Feryal / craft-env
☆44Updated 6 years ago
Maluuba / srw
Dead-ends and Secure Exploration in Reinforcement Learning
☆11Updated 6 years ago
RonanFR / UCRL
☆27Updated 6 years ago
facebookresearch / reward-estimator-corl
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆22Updated 6 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
EndingCredits / Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
☆58Updated 7 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆95Updated 4 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
Akella17 / Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
☆16Updated 4 years ago
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
louaaron / GAN-Q-Learning
Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Updated 4 years ago
Alfo5123 / Robust-Multitask-RL
Machine Learning Course Project Skoltech 2018
☆108Updated 6 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆28Updated 6 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆63Updated last year
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆19Updated 4 years ago
edbeeching / 3d_control_deep_rl
Baselines and memory-based scenarios for the ViZDoom simulator
☆36Updated 2 years ago
TomZahavy / GrayingTheBox
Code implementation of: "Graying the black box: Understanding DQNs"
☆20Updated 8 years ago
koulanurag / mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
☆50Updated 2 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 4 years ago