FuxiRL / FeverBasketball
☆55Updated this week
Related projects: ⓘ
- ☆79Updated 2 months ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- ☆135Updated 3 years ago
- MADDPG in Ray/RLlib☆50Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆148Updated last year
- CommNet and BiCnet implementation in tensorflow☆54Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 years ago
- ☆45Updated 5 years ago
- ☆117Updated last year
- Multi-Agent Determinantal Q-Learning☆41Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆128Updated 3 years ago
- A Multi-agent Learning Framework☆61Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- Learning-based agent for Google Research Football (足球游戏智能体)☆106Updated last year
- ☆24Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- ☆15Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆68Updated 10 months ago
- ☆45Updated 4 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- PyTorch implementation of Deep Reinforcement Algorithm☆30Updated 2 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆18Updated 3 years ago
- ☆68Updated 7 months ago
- an implementation of CommNet☆29Updated 6 years ago