PKU-Alignment / Safe-Policy-OptimizationLinks
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
β388Updated last year
Alternatives and similar repositories for Safe-Policy-Optimization
Users that are interested in Safe-Policy-Optimization are comparing it to the libraries listed below
Sorting:
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmarkβ527Updated 3 weeks ago
- π A fast safe reinforcement learning library in PyTorchβ226Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.β373Updated 5 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (teβ¦β175Updated 2 years ago
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ225Updated last year
- The repository is for safe reinforcement learning baselines.β739Updated 2 months ago
- A collection of offline reinforcement learning algorithms.β207Updated last year
- A plotter for reinforcement learning (RL)β234Updated 4 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.β363Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)β81Updated 2 years ago
- DSAC; Distributional Soft Actor-Criticβ135Updated 10 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).β214Updated last year
- β220Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ182Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RLβ381Updated 4 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.β234Updated 5 years ago
- PyTorch implementation of Constrained Policy Optimizationβ56Updated 4 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022β340Updated last year
- A simple implementation of Generative Adversarial Imitation Learning with PyTorchβ173Updated 3 years ago
- Constrained Policy Optimization implementation on Safety Gymβ28Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)β112Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β93Updated 2 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.β452Updated 2 years ago
- This is the official implementation of Multi-Agent PPO.β129Updated 2 years ago
- β464Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learningβ171Updated last year
- β106Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β176Updated last year
- A collection of recent MARL papersβ99Updated last year
- β46Updated 3 years ago