uncharted-technologies / risk-and-uncertainty
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆30Updated 2 years ago
Alternatives and similar repositories for risk-and-uncertainty:
Users that are interested in risk-and-uncertainty are comparing it to the libraries listed below
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆21Updated 11 months ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- ☆32Updated 7 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 2 weeks ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆37Updated 2 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- ☆36Updated 3 years ago
- on-policy optimization baselines for deep reinforcement learning☆29Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆53Updated 4 years ago
- ☆18Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆34Updated 2 years ago
- ☆24Updated 7 months ago
- My Body Is A Cage☆39Updated 3 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- ☆41Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- ☆29Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Updated last year
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆46Updated 4 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 4 months ago