Efficient Exploration through Bayesian Deep Q-Networks
☆37Feb 14, 2018Updated 8 years ago
Alternatives and similar repositories for BDQN-MxNet-Gluon
Users that are interested in BDQN-MxNet-Gluon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- gpbo☆25Jan 18, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆21Jul 19, 2022Updated 3 years ago
- Tensorflow implementation of BootstrappedDQN using OpenAI baselines☆19Jan 12, 2021Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 10 months ago
- OAI Network Service in OSM☆12Sep 13, 2025Updated 8 months ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆132Jun 11, 2019Updated 6 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Source code for the Joint Shapley values: a measure of joint feature importance☆12Sep 14, 2021Updated 4 years ago
- Apache Polaris Tools, additional tooling for Apache Polaris☆28Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Deep recurrent Q learning on CartPole-v1 environment☆95Jan 15, 2024Updated 2 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程(中英双语)☆12Aug 17, 2023Updated 2 years ago
- The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Mod…☆15Aug 13, 2021Updated 4 years ago
- TS_SPMA: The Tabu Search algorithm for simultaneous scheduling problem of machines and AGVs.☆12Apr 30, 2021Updated 5 years ago
- Recorder for Azure Kinect☆14Aug 6, 2020Updated 5 years ago
- 📅 Production-ready scheduler with async, multithreading and multiprocessing support for Python☆22Jul 6, 2024Updated last year
- ☆13Dec 28, 2018Updated 7 years ago
- A portable parser combinator library that does not require a runtime☆13Sep 16, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- ☆14Mar 5, 2026Updated 2 months ago
- ☆11Oct 19, 2020Updated 5 years ago
- Simple Goal Oriented Action Planning demo written in Javascript with Phaser for studies.☆12Aug 31, 2015Updated 10 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Dec 8, 2022Updated 3 years ago
- Simple code for running and visualizing replicator dynamics☆11Jan 31, 2024Updated 2 years ago