smsxgz / oh-my-q-learningLinks
Our implementation of the Q-learning algorithms by tensorflow or pytorch. @smsxgz @yangwenhaosms @hzxsnczpku
☆8Updated 6 years ago
Alternatives and similar repositories for oh-my-q-learning
Users that are interested in oh-my-q-learning are comparing it to the libraries listed below
Sorting:
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Updated 6 years ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Updated 5 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- Stein Variational Policy Gradient for REINFORCE☆18Updated 8 years ago
- Repository for our ICML 2019 paper: Curiosity-Bottleneck☆34Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- ☆111Updated 5 years ago
- ☆67Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆33Updated 4 years ago
- ☆17Updated 3 years ago
- ☆27Updated 6 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Updated 6 years ago
- ☆11Updated 5 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Updated 4 years ago
- Upper Confidence Tree Planner for ATARI games☆19Updated 9 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- ☆43Updated 6 years ago
- P3O paper code☆29Updated 5 years ago
- code for "Quantile Stein Variational Gradient Descent"☆9Updated 6 years ago
- ☆61Updated 7 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆21Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆54Updated 5 years ago
- homework for CS294 Fall 2017☆167Updated 7 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆96Updated 7 years ago
- ☆43Updated 8 years ago
- Code for "Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight"☆79Updated 7 years ago
- Code for paper Causal Confusion in Imitation Learning☆45Updated 5 years ago