RodkinIvan / Transformer-RL
Transformers (GTrXL & CoBERL) applied to RL tasks
☆28Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Transformer-RL
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆148Updated 4 months ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆110Updated 7 months ago
- ☆40Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 10 months ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆57Updated 2 years ago
- ☆186Updated last year
- This is the official implementation of Multi-Agent PPO.☆93Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆170Updated last year
- ☆25Updated 7 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆95Updated 3 years ago
- Transformer in RL for decision-making☆75Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- A Reinforcement Learning Project using PPO + LSTM☆45Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- ☆88Updated 4 years ago
- ☆90Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated last year
- ☆216Updated 9 months ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆44Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆114Updated 9 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 5 months ago
- A collection of recent MARL papers☆75Updated this week
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆130Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year