Efficient Exploration through Bayesian Deep-Q Networks.
☆18Mar 22, 2022Updated 4 years ago
Alternatives and similar repositories for BDQN-PyTorch
Users that are interested in BDQN-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Exploration through Bayesian Deep Q-Networks☆38Feb 14, 2018Updated 8 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- Use of reinforcement learning and deep reinforcement learning algorithms to optimize the UAV based cellular network for higher throughput…☆26Mar 5, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆19Mar 5, 2018Updated 8 years ago
- Regularized Learning under label shifts☆18May 1, 2019Updated 7 years ago
- [NeurIPS 2025 Spotlight] "Stochastic Process Learning via Operator Flow Matching"☆22Apr 26, 2026Updated last month
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆12Jul 15, 2022Updated 3 years ago
- Deep Reinforcement Learning and BCD to solve phase shift and resource allocation of RIS and RSU☆32Jan 18, 2021Updated 5 years ago
- A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程(中英双语)☆12Aug 17, 2023Updated 2 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- A portable parser combinator library that does not require a runtime☆13Sep 16, 2019Updated 6 years ago
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆17Jun 20, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT☆37Mar 25, 2024Updated 2 years ago
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated 5 months ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Research on fee markets and resource allocation in blockchains.☆10Mar 14, 2023Updated 3 years ago
- ☆16May 20, 2025Updated last year
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13Nov 28, 2023Updated 2 years ago
- ☆35May 24, 2023Updated 3 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- Q learning and DQN☆10Mar 14, 2022Updated 4 years ago
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- Source code for ICML 2023 paper "Competing for Shareable Arms in Multi-Player Multi-Armed Bandits"☆10May 14, 2024Updated 2 years ago
- ☆10Sep 21, 2020Updated 5 years ago
- ☆17Apr 14, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Dynamic Task Software Caching-Assisted Computation Offloading for Multi-Access Edge Computing☆11Dec 18, 2022Updated 3 years ago
- ☆12Jan 6, 2022Updated 4 years ago
- Rust implementation of a basic SPICE simulator☆11May 30, 2023Updated 3 years ago
- A LLM-powered agent for NetHack☆23Nov 4, 2024Updated last year
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆18Jan 4, 2023Updated 3 years ago
- ☆11Jun 29, 2021Updated 4 years ago
- Code of Paper "Cooperative Sensing and Uploading for Quality-Cost Tradeoff of Digital Twins in VEC", IEEE TCE, 2024.☆12Jul 10, 2023Updated 2 years ago