Efficient Exploration through Bayesian Deep Q-Networks
☆37Feb 14, 2018Updated 8 years ago
Alternatives and similar repositories for BDQN-MxNet-Gluon
Users that are interested in BDQN-MxNet-Gluon are comparing it to the libraries listed below
Sorting:
- ☆13Dec 28, 2018Updated 7 years ago
- gpbo☆25Jan 18, 2021Updated 5 years ago
- 基于insightface训练mobilefacenet的调试步骤,更改模型后一层训练结果为99.683% in lfw and 96.717 in agedb. Now pls move to the new mobilefacenet-V2…☆11Aug 28, 2018Updated 7 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆129Jun 11, 2019Updated 6 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 8 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- Deep exponential family models in MXNet/Gluon. Layers o' latents 💤☆17Oct 16, 2017Updated 8 years ago
- Tensorflow implementation of BootstrappedDQN using OpenAI baselines☆19Jan 12, 2021Updated 5 years ago
- Variational Auto-Regressive Gaussian Processes for Continual Learning☆22Jun 15, 2021Updated 4 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- DQN-MxNet-Gluon☆23Nov 12, 2017Updated 8 years ago
- Mxnet version of https://github.com/ZheC/Realtime_Multi-Person_Pose_Estimation☆25Jan 29, 2020Updated 6 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Sep 5, 2018Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31May 30, 2018Updated 7 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Depth_conv for MobileNet☆30Jun 22, 2020Updated 5 years ago
- Given the previous frames of the video as input, we want to get the long-term frame prediction.☆32Aug 28, 2017Updated 8 years ago
- Hierarchical Deep RL Network☆31Feb 20, 2017Updated 9 years ago
- ☆15May 20, 2025Updated 9 months ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- A set of competitive environments for Reinforcement Learning research.☆30Dec 1, 2022Updated 3 years ago
- Learning Detection with Diverse Proposals at CVPR 2017☆31May 24, 2017Updated 8 years ago
- Find Functions and their Dependencies☆13Feb 21, 2022Updated 4 years ago
- Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.☆11Aug 31, 2020Updated 5 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆13Aug 15, 2023Updated 2 years ago
- Terminal user interface for a Kanban board☆11Nov 5, 2021Updated 4 years ago
- Accelerator Zoo☆20Oct 14, 2025Updated 4 months ago
- College project about article http://www.cs.ust.hk/~quan/publications/yuan-deblur-siggraph07.pdf☆10Jan 25, 2013Updated 13 years ago