IBM / vsrl-framework
The Verifiably Safe Reinforcement Learning Framework
☆56Updated 3 years ago
Alternatives and similar repositories for vsrl-framework:
Users that are interested in vsrl-framework are comparing it to the libraries listed below
- Reinforcement Learning framework for Temporal Goals☆11Updated 2 years ago
- Logically-Constrained Reinforcement Learning☆54Updated 8 months ago
- ☆41Updated 2 years ago
- ☆66Updated last year
- Learning algorithm implementation and experiments in the paper "A Composable Specification Language for Reinforcement Learning Tasks" (ht…☆16Updated 4 years ago
- Reachability Analysis Tool of Neural Network Controlled Systems (NNCSs)☆17Updated 2 years ago
- DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning☆21Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- ☆18Updated last year
- Neurosymbolic transformers for multi-agent communication.☆22Updated 4 years ago
- ☆24Updated 2 years ago
- Code for experiments in the paper: "Compositional Reinforcement Learning from Logical Specifications" (https://arxiv.org/abs/2106.13906).☆15Updated 3 years ago
- Implementation of PDFAs and PDFA learning algorithm.☆11Updated 4 years ago
- From LTLf / PPLTL to Deterministic Finite-state Automata (DFA)☆70Updated last year
- Safe exploration in Markov Decision Processes☆37Updated 7 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆18Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆49Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆34Updated 2 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- SmalL bUt Complete GROne Synthesizer☆38Updated 7 months ago
- Synthesizer of LTLf formula☆10Updated last month
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- ☆30Updated last year
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆69Updated 4 years ago
- ☆18Updated 4 years ago
- Bridging State and History Representations: Understanding Self-Predictive RL -- ICLR 2024☆18Updated 11 months ago