Jingliang-Duan / DSAC-v1Links
DSAC; Distributional Soft Actor-Critic
β132Updated 7 months ago
Alternatives and similar repositories for DSAC-v1
Users that are interested in DSAC-v1 are comparing it to the libraries listed below
Sorting:
- π A fast safe reinforcement learning library in PyTorchβ216Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).β192Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)β112Updated 4 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)β80Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"β61Updated 2 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]β73Updated 6 years ago
- β76Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.β53Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorchβ50Updated 3 years ago
- β217Updated 2 years ago
- β102Updated 3 years ago
- PyTorch implementation of Constrained Policy Optimizationβ55Updated 3 years ago
- Transformer in RL for decision-makingβ100Updated 2 years ago
- Distributional Soft Actor Criticβ58Updated 5 years ago
- β43Updated 3 years ago
- β41Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RLβ¦β292Updated 4 years ago
- This is the official implementation of Multi-Agent PPO.β117Updated 2 years ago
- β54Updated 4 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..β¦β142Updated last year
- Deep recurrent Q learning on CartPole-v1 environmentβ93Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.β163Updated last year
- PyTorch implementation of GAIL and AIRL based on PPO.β226Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (teβ¦β175Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β66Updated last year
- β39Updated 3 years ago
- A plotter for reinforcement learning (RL)β232Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environmentβ70Updated last year
- Single-file pytorch implementation of hybrid-SACβ59Updated 4 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotliβ¦β135Updated 4 years ago