RodkinIvan / Transformer-RL
Transformers (GTrXL & CoBERL) applied to RL tasks
☆26Updated 2 years ago
Related projects: ⓘ
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- ☆38Updated 2 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆38Updated this week
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆68Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- Transformer in RL for decision-making☆71Updated last year
- ☆87Updated 2 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆109Updated last year
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆161Updated last year
- ☆87Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆56Updated this week
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆139Updated 5 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆130Updated 2 months ago
- ☆180Updated last year
- This is the official implementation of Multi-Agent PPO.☆89Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆50Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆41Updated 2 years ago
- There will be updates later☆79Updated 5 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆123Updated 8 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆70Updated 9 months ago
- ☆16Updated 2 years ago
- Code for Weighted QMIX☆119Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- ☆19Updated 5 months ago
- ☆96Updated 2 months ago
- Google Research Football MARL Benchmark and Research Toolkit☆28Updated 4 months ago