SapanaChaudhary / PyTorch-CPOLinks
PyTorch implementation of Constrained Policy Optimization
☆55Updated 3 years ago
Alternatives and similar repositories for PyTorch-CPO
Users that are interested in PyTorch-CPO are comparing it to the libraries listed below
Sorting:
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆182Updated last year
- Implementation of PPO Lagrangian in PyTorch☆50Updated 2 years ago
- Constrained Policy Optimization implementation on Safety Gym☆28Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆58Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 6 years ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 6 months ago
- ☆75Updated last year
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆52Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆115Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆80Updated 2 years ago
- ☆215Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- ☆40Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 6 months ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆209Updated 10 months ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆219Updated 6 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆73Updated last year
- Code for Weighted QMIX☆138Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆111Updated 4 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 5 years ago
- ☆102Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆161Updated last year
- ☆41Updated 3 years ago
- A plotter for reinforcement learning (RL)☆229Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆165Updated last year
- ☆53Updated 6 years ago