dityas / Protos
Factored Interactive POMDP solver based on symbolic Perseus.
☆10Updated last year
Alternatives and similar repositories for Protos:
Users that are interested in Protos are comparing it to the libraries listed below
- ☆40Updated 2 years ago
- Code to train RL agents along with Adversarial distrubance agents☆63Updated 7 years ago
- Logically-Constrained Reinforcement Learning☆53Updated 6 months ago
- Hierarchical Online Planning and Reinforcement Learning on Taxi☆30Updated 7 years ago
- Learning algorithm implementation and experiments in the paper "A Composable Specification Language for Reinforcement Learning Tasks" (ht…☆16Updated 4 years ago
- Gym-like extensions for POMDP☆57Updated 3 years ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆13Updated last year
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17Updated 2 years ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆60Updated 3 years ago
- ☆26Updated 4 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆23Updated 4 years ago
- Reinforcement Learning framework for Temporal Goals☆11Updated last year
- Gridworld for MARL experiments☆138Updated 3 years ago
- ☆17Updated last year
- [PLDI 19'] An Inductive Synthesis Framework for Verifiable Reinforcement Learning☆13Updated 5 years ago
- Solving POMDP using Recurrent networks☆85Updated 4 years ago
- The Verifiably Safe Reinforcement Learning Framework☆56Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated 2 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆73Updated 4 years ago
- ☆12Updated 4 years ago
- ☆71Updated 7 months ago
- an implementation of ATOC☆14Updated 3 years ago
- Cyber Operations Research Gym☆67Updated 7 months ago
- ☆16Updated last year
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- A practical step-by-step guide to applying RUDDER☆34Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago