PKU-Alignment / Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
☆357Updated last year
Alternatives and similar repositories for Safe-Policy-Optimization
Users that are interested in Safe-Policy-Optimization are comparing it to the libraries listed below
Sorting:
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆452Updated 2 months ago
- The repository is for safe reinforcement learning baselines.☆637Updated 3 weeks ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆186Updated 7 months ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆329Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆201Updated 8 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆359Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆169Updated last year
- A collection of offline reinforcement learning algorithms.☆180Updated 5 months ago
- ☆203Updated last year
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆341Updated last month
- Official implementation of HARL algorithms based on PyTorch.☆676Updated 2 weeks ago
- A plotter for reinforcement learning (RL)☆223Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆350Updated 2 years ago
- ☆397Updated last year
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆475Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆217Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆542Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆145Updated 11 months ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆425Updated 2 years ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆209Updated 5 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆109Updated 3 years ago
- Code for conservative Q-learning☆438Updated 3 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆150Updated 10 months ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆318Updated 8 months ago
- This is the official implementation of Multi-Agent PPO.☆106Updated 2 years ago