liuanji / WU-UCTLinks
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆118Updated 4 years ago
Alternatives and similar repositories for WU-UCT
Users that are interested in WU-UCT are comparing it to the libraries listed below
Sorting:
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- Keeping track of RL experiments☆161Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 9 months ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆101Updated 2 years ago
- ☆143Updated 5 months ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 2 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- ☆129Updated 10 months ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆168Updated 3 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆92Updated last month
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆52Updated 10 months ago
- PyTorch RL for Pommerman☆38Updated 6 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆160Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆108Updated 2 years ago
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆58Updated 2 years ago
- ☆120Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆161Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆125Updated 4 years ago
- ☆4Updated 5 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Soft Actor-Critic☆147Updated 7 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆68Updated 2 years ago