PKU-Alignment / Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
☆321Updated 5 months ago
Related projects: ⓘ
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆377Updated 4 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆161Updated last year
- The repository is for safe reinforcement learning baselines.☆467Updated 2 weeks ago
- Official implementation of HARL algorithms based on PyTorch.☆455Updated 6 months ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆150Updated this week
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆387Updated last year
- ☆323Updated 8 months ago
- A plotter for reinforcement learning (RL)☆205Updated 2 years ago
- DSAC-v2; DASC; Distributional Soft Actor-Critic☆210Updated 5 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆139Updated 5 months ago
- ☆180Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆260Updated 5 months ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆467Updated last month
- Multi-Agent Reinforcement Learning (MARL) papers☆196Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆161Updated this week
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆325Updated last year
- ☆197Updated 7 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆311Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆108Updated 6 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆115Updated 3 months ago
- This is the official implementation of Multi-Agent PPO.☆89Updated last year
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆185Updated 5 years ago
- Code for conservative Q-learning☆393Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆153Updated 3 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆182Updated 3 years ago
- Multi-Objective Reinforcement Learning algorithms implementations.☆277Updated last week
- A Collection of Multi-Agent Reinforcement Learning (MARL) Resources☆195Updated last year
- PyTorch implementation of SAC-Discrete.☆273Updated last month
- ☆159Updated 11 months ago
- PPO, DDPG, SAC implementation on mujoco environment☆85Updated 2 years ago