ertsiger / gym-subgoal-automataLinks
Environments from the papers "Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning" and "Induction and Exploitation of Subgoal Automata for Reinforcement Learning" using OpenAI Gym API.
☆12Updated last year
Alternatives and similar repositories for gym-subgoal-automata
Users that are interested in gym-subgoal-automata are comparing it to the libraries listed below
Sorting:
- Paper list for constrained policy optimization in reinforcement learning.☆73Updated last year
- Logically-Constrained Reinforcement Learning☆54Updated last year
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆156Updated 2 years ago
- Multi-Objective Reinforcement Learning☆279Updated 3 years ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆13Updated 2 years ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆96Updated 2 years ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆241Updated 5 months ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆47Updated last year
- ☆76Updated 5 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 6 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 8 months ago
- Implementation of PPO Lagrangian in PyTorch☆49Updated 2 years ago
- [NeurIPS 2020] PyTorch implementation of "Learning Implicit Credit Assignment for Cooperative Muti-Agent Reinforcement Learning"☆60Updated last year
- ☆44Updated last week
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- Explainable Causal Reinforcement Learning with attention☆29Updated 2 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆144Updated 3 years ago
- ☆42Updated 2 years ago
- A plotter for reinforcement learning (RL)☆226Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆58Updated last year
- Reinforcement Learning Benchmarks for Traffic Signal Control (RESCO)☆145Updated last year
- Official implementation of "Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand☆78Updated 4 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆79Updated 2 years ago
- Code for Weighted QMIX☆139Updated 4 years ago
- ☆124Updated 3 years ago
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆23Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆60Updated 5 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆180Updated last year
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆316Updated 2 years ago