Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for risk-and-uncertainty
Users that are interested in risk-and-uncertainty are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆17Oct 14, 2021Updated 4 years ago
- Archiving everything we have studied☆10Jan 23, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Repository for studying distributional rl☆30Feb 2, 2025Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- [RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation☆21Jun 6, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- A dataloader, but for JAX☆20May 17, 2024Updated 2 years ago
- 심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings☆11May 10, 2024Updated 2 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated 3 months ago
- ☆14Jun 11, 2024Updated last year
- This repository is the official implementation of Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor N…☆52Nov 27, 2020Updated 5 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 7 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- IROS 2018 Software Tutorial on XBotControl☆10Oct 16, 2019Updated 6 years ago
- ☆18Apr 17, 2026Updated last month
- Vectorization techniques for fast population-based training.☆57Apr 26, 2026Updated 3 weeks ago
- ☆35Jul 10, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- ☆16Nov 27, 2016Updated 9 years ago
- Factored model-based Bayesian Reinforcement Learning framework☆21Nov 23, 2022Updated 3 years ago
- A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程(中英双语)☆11Aug 17, 2023Updated 2 years ago
- ☆56Jan 20, 2023Updated 3 years ago
- ☆15Feb 18, 2020Updated 6 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆22Dec 8, 2023Updated 2 years ago
- ☆25Apr 16, 2024Updated 2 years ago
- ☆16May 20, 2025Updated last year
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 4 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)☆10Dec 3, 2025Updated 5 months ago