williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆84Updated last year
Alternatives and similar repositories for Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey:
Users that are interested in Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey are comparing it to the libraries listed below
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆68Updated last year
- Implementation of PPO Lagrangian in PyTorch☆35Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆68Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- DSAC; Distributional Soft Actor-Critic☆121Updated 11 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆157Updated 9 months ago
- ☆67Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆170Updated 4 months ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆186Updated 4 months ago
- A collection of recent MARL papers☆82Updated 2 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆67Updated 5 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆128Updated 7 months ago
- PyTorch implementation of Constrained Policy Optimization☆50Updated 3 years ago
- The implementation of LSTM-TD3.☆73Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆134Updated 9 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆53Updated 7 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆66Updated 7 months ago
- ☆27Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆124Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆97Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆152Updated 6 months ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆160Updated 2 years ago
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆32Updated 8 months ago
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 4 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆114Updated 9 months ago
- ☆93Updated 3 years ago
- ☆35Updated last month