Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for risk-and-uncertainty
Users that are interested in risk-and-uncertainty are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for studying distributional rl☆30Feb 2, 2025Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- ☆12Mar 15, 2022Updated 4 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆37Jan 24, 2026Updated 3 months ago
- ☆14Jun 11, 2024Updated last year
- This repository is the official implementation of Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor N…☆52Nov 27, 2020Updated 5 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SGToolkit: An Interactive Gesture Authoring Toolkit for Embodied Conversational Agents (UIST 2021)☆45Sep 14, 2022Updated 3 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 7 years ago
- ☆20Jul 9, 2025Updated 9 months ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- IROS 2018 Software Tutorial on XBotControl☆10Oct 16, 2019Updated 6 years ago
- ☆18Apr 17, 2026Updated last week
- Vectorization techniques for fast population-based training.☆57Aug 12, 2022Updated 3 years ago
- ☆11Oct 24, 2023Updated 2 years ago
- ☆35Jul 10, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- ☆16Nov 27, 2016Updated 9 years ago
- Factored model-based Bayesian Reinforcement Learning framework☆21Nov 23, 2022Updated 3 years ago
- A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程(中英双语)☆11Aug 17, 2023Updated 2 years ago
- ☆55Jan 20, 2023Updated 3 years ago
- ☆15Feb 18, 2020Updated 6 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆20Mar 19, 2026Updated last month
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient ex…☆11Nov 29, 2020Updated 5 years ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆22Dec 8, 2023Updated 2 years ago
- ☆25Apr 16, 2024Updated 2 years ago
- ☆15May 20, 2025Updated 11 months ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago