kazizzad/BDQN-MxNet-Gluon

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kazizzad/BDQN-MxNet-Gluon)

kazizzad / BDQN-MxNet-Gluon

Efficient Exploration through Bayesian Deep Q-Networks

☆38

Alternatives and similar repositories for BDQN-MxNet-Gluon

Users that are interested in BDQN-MxNet-Gluon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JoungheeKim / bootsrapped-dqn
View on GitHub
This is pytorch implmentation project of Bootsrapped DQN
☆13Dec 6, 2020Updated 5 years ago
IandRover / meta_gradient_RL
View on GitHub
Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"
☆21Jul 19, 2022Updated 4 years ago
markm541374 / gpbo
View on GitHub
gpbo
☆25Jan 18, 2021Updated 5 years ago
rrmenon10 / Bootstrapped-DQN
View on GitHub
Tensorflow implementation of BootstrappedDQN using OpenAI baselines
☆19Jan 12, 2021Updated 5 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cmusjtuliuyuan / RainBow
View on GitHub
RainBow, Tensorflow
☆49Mar 28, 2018Updated 8 years ago
pathak22 / exploration-by-disagreement
View on GitHub
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
☆131Jun 11, 2019Updated 7 years ago
gkswamy98 / causal_il
View on GitHub
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…
☆11Dec 9, 2022Updated 3 years ago
abhishm / PGQ
View on GitHub
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
☆15Mar 9, 2017Updated 9 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
RickYang2016 / Bayesian-Soft-Actor-Critic
View on GitHub
Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…
☆13Dec 28, 2022Updated 3 years ago
andry91 / Max_Sum_Python
View on GitHub
MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)
☆11Jan 15, 2018Updated 8 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kazizzad / DCGAN-Gluon-MxNet
View on GitHub
☆12Sep 30, 2017Updated 8 years ago
tfolkman / deep-learning-experiments
View on GitHub
Deep Learning Experiments Motivated from Fastai Course
☆14Jan 2, 2019Updated 7 years ago
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 4 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
LinZichuan / emdqn
View on GitHub
Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018
☆63Sep 5, 2018Updated 7 years ago
rlbayes / rllabplusplus
View on GitHub
☆162Jul 21, 2017Updated 9 years ago
zcchenvy / CIL-DDQN
View on GitHub
code of paper 《Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem》
☆17Dec 14, 2020Updated 5 years ago
HongyeGuo / DIRL-bidding_preference
View on GitHub
The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Mod…
☆15Aug 13, 2021Updated 4 years ago
uber-research / vargp
View on GitHub
Variational Auto-Regressive Gaussian Processes for Continual Learning
☆22Jun 15, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
shoda888 / AKRecorder
View on GitHub
Recorder for Azure Kinect
☆14Aug 6, 2020Updated 5 years ago
nklein23 / mxnet-with-R
View on GitHub
☆13Dec 28, 2018Updated 7 years ago
atomicptr / goap
View on GitHub
Simple Goal Oriented Action Planning demo written in Javascript with Phaser for studies.
☆12Aug 31, 2015Updated 10 years ago
feli-s / algorithm-selector-for-AGV-scheduling
View on GitHub
This repository stores code about the JSSP and FJSSP scheduling problem solved with two constraint programming solvers: IBM CPLEX CP Opti…
☆15Dec 15, 2022Updated 3 years ago
adam-mcdaniel / honeycomb
View on GitHub
A portable parser combinator library that does not require a runtime
☆13Sep 16, 2019Updated 6 years ago
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
rahulrahaman / Uncertainty-Quantification-and-Deep-Ensemble
View on GitHub
Experiments from our work Uncertainty Quantification and Deep Ensemble
☆10Nov 1, 2021Updated 4 years ago
tarik / pi-snm-qde
View on GitHub
Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.
☆11Aug 31, 2020Updated 5 years ago
rickwierenga / opentrons-python-api
View on GitHub
Simple (and currently incomplete) Python wrapper around the Opentrons HTTP API
☆11Nov 26, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uncharted-technologies / risk-and-uncertainty
View on GitHub
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago
SCP-CN-001 / RL101
View on GitHub
A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程（中英双语）
☆13Aug 17, 2023Updated 2 years ago
sinairv / GridSoccerSimulator
View on GitHub
A multi-agent soccer simulator in a grid-world environment, with agents implementing different reinforcement learning algorithms
☆13Jun 4, 2017Updated 9 years ago
IndustAI / risk-and-uncertainty
View on GitHub
Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"
☆11Oct 3, 2023Updated 2 years ago
geraudnt / boolean_composition
View on GitHub
Code for the paper "A Boolean Task Algebra For Reinforcement Learning"
☆11Dec 8, 2022Updated 3 years ago
jachiam / surprise
View on GitHub
Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
AurelianTactics / dqnclipped_dqnreg_prelim_implementation
View on GitHub
Implementing DQNClipped and DQNReg Algorithms
☆10Mar 2, 2021Updated 5 years ago