smsxgz / oh-my-q-learningLinks

Our implementation of the Q-learning algorithms by tensorflow or pytorch. @smsxgz @yangwenhaosms @hzxsnczpku

☆8

Alternatives and similar repositories for oh-my-q-learning

Users that are interested in oh-my-q-learning are comparing it to the libraries listed below

Sorting:

KAIST-AILab / gmmil
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Updated 6 years ago
riejohnson / cfg-gan
CFG-GAN: Composite functional gradient learning of generative adversarial models
☆15Updated 5 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
largelymfs / svpg_REINFORCE
Stein Variational Policy Gradient for REINFORCE
☆18Updated 8 years ago
whyjay / curiosity-bottleneck
Repository for our ICML 2019 paper: Curiosity-Bottleneck
☆34Updated 2 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
implementation-matters / code-for-paper
☆111Updated 5 years ago
cle-ros / RoutingNetworks
☆67Updated 4 years ago
lns / dapo
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Updated 5 years ago
kaixin96 / mixreg
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆33Updated 4 years ago
gnobitab / MultiObjectiveSampling
☆17Updated 3 years ago
wendazhou / nnet-compression-generalization
☆27Updated 6 years ago
NoListen / ERL
Exploration based Reinforcement Learning. (Montezuma Revenge)
☆14Updated 6 years ago
roosephu / boots
☆11Updated 5 years ago
tgangwani / SelfImitationDiverse
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Updated 4 years ago
XiaoxiaoGuo / atari_uct
Upper Confidence Tree Planner for ATARI games
☆19Updated 9 years ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆166Updated 7 years ago
russellmendonca / maesn_suite
☆43Updated 6 years ago
rasoolfa / P3O
P3O paper code
☆29Updated 5 years ago
ChengyueGongR / QSVGD
code for "Quantile Stein Variational Gradient Descent"
☆9Updated 6 years ago
Hwhitetooth / lirpg
☆61Updated 7 years ago
alecwangcq / KFAC-Pytorch
Pytorch implementation of KFAC and E-KFAC (Natural Gradient).
☆132Updated 6 years ago
dilinwang820 / adaptive-f-divergence
A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"
☆21Updated 6 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
pokaxpoka / netrand
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
☆54Updated 5 years ago
Observerspy / CS294
homework for CS294 Fall 2017
☆167Updated 7 years ago
florensacc / snn4hrl
Stochastic Neural Networks for Hierarchical Reinforcement Learning
☆96Updated 7 years ago
mcmachado / options
☆43Updated 8 years ago
yenchenlin / rl-attack-detection
Code for "Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight"
☆79Updated 7 years ago
pimdh / causal-confusion
Code for paper Causal Confusion in Imitation Learning
☆45Updated 5 years ago