jemaw / gym-safetyLinks
Simple gym environments for safety in Reinforcement Learning Research
☆18Updated last year
Alternatives and similar repositories for gym-safety
Users that are interested in gym-safety are comparing it to the libraries listed below
Sorting:
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Updated 3 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆143Updated last year
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆71Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆183Updated 3 years ago
- Code for a model-based version of Constrained Policy Optimization☆11Updated 4 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆26Updated 5 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆72Updated 4 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Updated 3 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆55Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- There will be updates later☆88Updated 6 years ago
- ☆44Updated 4 years ago
- Gridworld for MARL experiments☆144Updated 4 years ago
- ☆42Updated 4 years ago
- A reusable framework for successor features for transfer in deep reinforcement learning using keras.☆48Updated 4 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- ☆49Updated 4 years ago
- ☆202Updated 2 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆109Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Updated last year
- Implementations of SAILR, PDO, and CSC☆31Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Updated 2 years ago
- Conservative Q Learning on top of SAC☆136Updated 3 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆99Updated 5 years ago