FuxiRL / FeverBasketball

☆55

Related projects: ⓘ

tencent-ailab / TLeague
☆79Updated 2 months ago
ChengTsang / PPO-clip-and-PPO-penalty-on-Atari-Domain
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
☆56Updated 5 years ago
tencent-ailab / tleague_projpage
☆135Updated 3 years ago
wsjeon / maddpg-rllib
MADDPG in Ray/RLlib
☆50Updated 4 years ago
facebookresearch / CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆127Updated last year
TonghanWang / ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
☆148Updated last year
Coac / CommNet-BiCnet
CommNet and BiCnet implementation in tensorflow
☆54Updated 6 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
YuhangSong / Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆101Updated 3 years ago
dadadidodi / m3ddpg
☆45Updated 5 years ago
qian18long / epciclr2020
☆117Updated last year
QDPP-GitHub / QDPP
Multi-Agent Determinantal Q-Learning
☆41Updated last year
Theohhhu / UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…
☆128Updated 3 years ago
ying-wen / malib_deprecated
A Multi-agent Learning Framework
☆61Updated 3 years ago
staghuntrpg / RPG
This is the source code of RPG (Reward-Randomized Policy Gradient)
☆43Updated 2 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆92Updated 4 years ago
root-master / unified-hrl
Unified Model-Free Hierarchical Reinforcement Learning Framework
☆37Updated 5 years ago
yalidu / liir
Learning Individual Intrinsic Reward in MARL
☆62Updated last year
TARTRL / TiKick
Learning-based agent for Google Research Football (足球游戏智能体)
☆106Updated last year
jidiai / Competition_Olympics-Integrated
☆24Updated 2 years ago
AnujMahajanOxf / MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
☆57Updated 2 years ago
lich14 / CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆83Updated last year
jidiai / Competition_Olympics-Running
☆15Updated 2 years ago
TonghanWang / RODE
Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …
☆68Updated 10 months ago
shariqiqbal2810 / multiagent-particle-envs
☆45Updated 4 years ago
wizdom13 / RND-Pytorch
Random Network Distillation(RND) algo in Pytorch
☆48Updated 5 years ago
LilTwo / DRL-using-PyTorch
PyTorch implementation of Deep Reinforcement Algorithm
☆30Updated 2 years ago
yexm-ze / maddpg-mpe
Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
☆18Updated 3 years ago
FuxiRL / DunkCityDynasty
☆68Updated 7 months ago
KornbergFresnel / CommNet
an implementation of CommNet
☆29Updated 6 years ago