Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"
☆17May 9, 2022Updated 4 years ago
Alternatives and similar repositories for AlwaysSafe
Users that are interested in AlwaysSafe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆67Aug 3, 2023Updated 2 years ago
- Bayes-Adaptive Monte-Carlo Planning algorithm☆19Mar 5, 2013Updated 13 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆48Dec 8, 2022Updated 3 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 8 years ago
- Apply safe RL methods from safety-starter-agents in highway-env☆14Jun 28, 2021Updated 4 years ago
- The Laser Learning Environment (LLE) is a cooperative MARL grid-world☆13Updated this week
- Simple gym environments for safety in Reinforcement Learning Research☆18Jul 17, 2024Updated last year
- Executive control code for STRANDS robots.☆11Feb 13, 2020Updated 6 years ago
- A toolkit for working with RDDL domains in Python3.☆17Nov 7, 2020Updated 5 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆463Apr 2, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Jan 16, 2023Updated 3 years ago
- Constrained Exploration and Recovery from Experience Shaping☆22Apr 18, 2019Updated 7 years ago
- ☆12Mar 14, 2024Updated 2 years ago
- CC-POMCP, "Monte-Carlo Tree Search for Constrained POMDPs (NIPS 2018)"☆27Sep 30, 2018Updated 7 years ago
- Implementation of a highway merging scenario☆30Aug 29, 2020Updated 5 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆14Aug 25, 2023Updated 2 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- ☆25Jan 2, 2023Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆65Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆51Apr 8, 2022Updated 4 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- RAN场景建模☆11Dec 3, 2019Updated 6 years ago
- Guarantee_Learning_Control☆11Sep 5, 2019Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆11Nov 18, 2023Updated 2 years ago
- [PLDI 19'] An Inductive Synthesis Framework for Verifiable Reinforcement Learning☆14Jan 14, 2020Updated 6 years ago
- ☆78Jun 2, 2024Updated last year
- Deep Q-Network (DQN) and DDPG to address the problem of stall around the wing sail of an autonomous sailing robot☆11Sep 18, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)☆10Dec 3, 2025Updated 5 months ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- A (mixed integer) linear optimisation model for local energy systems☆13May 27, 2021Updated 4 years ago
- A PyTorch implementation of "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆44Mar 1, 2023Updated 3 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- A Framework for Safe and Accelerated Reinforcement Learning-based Radio Resource Management☆21Oct 1, 2022Updated 3 years ago