kazizzad / BDQN-MxNet-GluonView external linksLinks
Efficient Exploration through Bayesian Deep Q-Networks
☆37Feb 14, 2018Updated 7 years ago
Alternatives and similar repositories for BDQN-MxNet-Gluon
Users that are interested in BDQN-MxNet-Gluon are comparing it to the libraries listed below
Sorting:
- ☆13Dec 28, 2018Updated 7 years ago
- gpbo☆25Jan 18, 2021Updated 5 years ago
- 基于insightface训练mobilefacenet的调试步骤,更改模型后一层训练结果为99.683% in lfw and 96.717 in agedb. Now pls move to the new mobilefacenet-V2…☆11Aug 28, 2018Updated 7 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 8 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- Deep exponential family models in MXNet/Gluon. Layers o' latents 💤☆17Oct 16, 2017Updated 8 years ago
- Variational Auto-Regressive Gaussian Processes for Continual Learning☆22Jun 15, 2021Updated 4 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 2 years ago
- Apache MXNet Site☆22Apr 24, 2025Updated 9 months ago
- Mxnet version of https://github.com/ZheC/Realtime_Multi-Person_Pose_Estimation☆25Jan 29, 2020Updated 6 years ago
- DQN-MxNet-Gluon☆23Nov 12, 2017Updated 8 years ago
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Dec 18, 2019Updated 6 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Deeplab for semantic segmentation implemented by MXNet☆22Mar 5, 2017Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31May 30, 2018Updated 7 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Depth_conv for MobileNet☆30Jun 22, 2020Updated 5 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Hierarchical Deep RL Network☆31Feb 20, 2017Updated 8 years ago
- Detect and reconstruct transparent objects from scan shadows☆10Sep 22, 2017Updated 8 years ago
- MXNet implementation of CapsNet☆29Nov 29, 2017Updated 8 years ago
- Given the previous frames of the video as input, we want to get the long-term frame prediction.☆32Aug 28, 2017Updated 8 years ago
- Recurrent Neural Networks with External Memory☆30Aug 15, 2015Updated 10 years ago
- blender based random procedural object generation for bullet grasping☆39Sep 27, 2019Updated 6 years ago
- Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.☆11Aug 31, 2020Updated 5 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- ☆11Jan 6, 2022Updated 4 years ago
- Find Functions and their Dependencies☆13Feb 21, 2022Updated 3 years ago
- College project about article http://www.cs.ust.hk/~quan/publications/yuan-deblur-siggraph07.pdf☆10Jan 25, 2013Updated 13 years ago
- Terminal user interface for a Kanban board☆11Nov 5, 2021Updated 4 years ago
- Tools for ML/MXNet on Kubernetes.☆44Feb 11, 2018Updated 8 years ago
- ☆36Aug 10, 2018Updated 7 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago