IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated 8 months ago
Alternatives and similar repositories for iv_rl
Users that are interested in iv_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆28Dec 6, 2020Updated 5 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆22Jul 19, 2022Updated 3 years ago
- Repository for Deep Active Localization research and benchmarks☆36May 12, 2020Updated 5 years ago
- ☆35Jul 10, 2021Updated 4 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- ☆15Apr 5, 2023Updated 2 years ago
- Model-based reinforcement learning using CEM, MPC and PETS☆16Nov 20, 2019Updated 6 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- ☆12Mar 17, 2024Updated 2 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957☆67Apr 30, 2021Updated 4 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆11May 5, 2020Updated 5 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Feb 14, 2018Updated 8 years ago
- ☆17Jul 3, 2017Updated 8 years ago
- Code for "Structured Sparsity Inducing Adaptive Optimizers for Deep Learning" in PyTorch☆18Feb 11, 2021Updated 5 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆36Jul 6, 2022Updated 3 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated 2 years ago
- Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game☆23Jan 29, 2023Updated 3 years ago
- ☆26Apr 26, 2024Updated last year
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆23Jul 16, 2022Updated 3 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago