NeteaseFuxiRL / wuji
original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning
☆26Updated 4 years ago
Alternatives and similar repositories for wuji:
Users that are interested in wuji are comparing it to the libraries listed below
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- ☆18Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆3Updated 5 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆47Updated 5 months ago
- Environments with IC3Net paper☆12Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- ☆18Updated 5 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 6 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆33Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- ☆18Updated 5 years ago
- ☆42Updated last month
- Multi-Objective Deep Reinforcement Learning☆42Updated 8 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 weeks ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- DecentralizedLearning☆22Updated 2 years ago
- ☆44Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago