dobro12 / CPOLinks
Constrained Policy Optimization implementation on Safety Gym
☆27Updated 3 years ago
Alternatives and similar repositories for CPO
Users that are interested in CPO are comparing it to the libraries listed below
Sorting:
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆55Updated last year
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- ☆74Updated last year
- Implementation of PPO Lagrangian in PyTorch☆45Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆76Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆62Updated 11 months ago
- DSAC; Distributional Soft Actor-Critic☆126Updated 3 months ago
- Robust and safe deep reinforcement learning algorithms☆14Updated last year
- Implementations of safe reinforcement learning algorithms☆27Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆173Updated last year
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 6 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆103Updated 2 years ago
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆65Updated 4 years ago
- ☆48Updated last month
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- ☆38Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆56Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆137Updated last year
- ☆40Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆52Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆168Updated 6 months ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆92Updated 8 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆41Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆69Updated last year
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆33Updated 3 years ago
- There will be updates later☆84Updated 6 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 2 years ago