williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆94Updated 2 years ago
Alternatives and similar repositories for Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey:
Users that are interested in Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey are comparing it to the libraries listed below
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆163Updated 11 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆160Updated 8 months ago
- ☆197Updated last year
- ☆42Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆70Updated last year
- A collection of recent MARL papers☆86Updated 3 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆68Updated 9 months ago
- Implementation of PPO Lagrangian in PyTorch☆38Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆70Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆120Updated 11 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆177Updated 5 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆56Updated 9 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆69Updated 5 years ago
- ☆29Updated 11 months ago
- ☆95Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- ☆58Updated last month
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆43Updated 6 months ago
- ☆108Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆152Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆137Updated last year
- A collection of offline reinforcement learning algorithms.☆174Updated 3 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆101Updated 3 years ago
- There will be updates later☆84Updated 5 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆195Updated 6 months ago
- ☆102Updated last month