Efficient Exploration through Bayesian Deep Q-Networks
☆37Feb 14, 2018Updated 8 years ago
Alternatives and similar repositories for BDQN-MxNet-Gluon
Users that are interested in BDQN-MxNet-Gluon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- gpbo☆25Jan 18, 2021Updated 5 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Tensorflow implementation of BootstrappedDQN using OpenAI baselines☆19Jan 12, 2021Updated 5 years ago
- RainBow, Tensorflow☆49Mar 28, 2018Updated 8 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆132Jun 11, 2019Updated 6 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- 明朝那些事儿☆10May 31, 2022Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- Minimal end-to-end LTE using srsRAN. Dockerized and emulated radio over shared memory.☆11Jun 7, 2021Updated 4 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆63Sep 5, 2018Updated 7 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Variational Auto-Regressive Gaussian Processes for Continual Learning☆22Jun 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Mod…☆15Aug 13, 2021Updated 4 years ago
- TS_SPMA: The Tabu Search algorithm for simultaneous scheduling problem of machines and AGVs.☆12Apr 30, 2021Updated 5 years ago
- Code for our GECCO 2021 paper : A Coevolutionary Approach to Deep Multi-agent Reinforcement Learning☆15Oct 4, 2021Updated 4 years ago
- Multi-agent reinforcement learning framework☆36Aug 13, 2020Updated 5 years ago
- Recorder for Azure Kinect☆14Aug 6, 2020Updated 5 years ago
- ☆13Dec 28, 2018Updated 7 years ago
- A portable parser combinator library that does not require a runtime☆13Sep 16, 2019Updated 6 years ago
- 基于insightface训练mobilefacenet的调试步骤,更改模型后一层训练结果为99.683% in lfw and 96.717 in agedb. Now pls move to the new mobilefacenet-V2…☆11Aug 28, 2018Updated 7 years ago
- Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.☆11Aug 31, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Oct 19, 2020Updated 5 years ago
- ☆16May 20, 2025Updated 11 months ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Dec 8, 2022Updated 3 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Analysing ML conference data and plotting interesting statistics.☆11Aug 4, 2023Updated 2 years ago
- Binary Neural Network on IceStick FPGA.☆55Jul 11, 2018Updated 7 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago