tencent-ailab / TLeague
☆4Updated 4 months ago
Alternatives and similar repositories for TLeague:
Users that are interested in TLeague are comparing it to the libraries listed below
- ☆142Updated 4 months ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆157Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated last month
- Keeping track of RL experiments☆162Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆100Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 3 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 2 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆57Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- ☆194Updated 2 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆160Updated 3 years ago
- ☆127Updated 8 months ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- ☆74Updated 10 months ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆147Updated 3 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆30Updated 5 years ago
- ☆53Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 7 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- ☆25Updated 2 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- There will be updates later☆84Updated 5 years ago
- ☆120Updated 2 years ago