liuanji / WU-UCTLinks

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

☆119

Alternatives and similar repositories for WU-UCT

Users that are interested in WU-UCT are comparing it to the libraries listed below

Sorting:

ray-project / rl-experiments
Keeping track of RL experiments
☆162Updated 2 years ago
lns / dapo
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Updated 5 years ago
facebookresearch / CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆130Updated last year
YuhangSong / Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Updated 4 months ago
MadryLab / implementation-matters
☆132Updated 11 months ago
tencent-ailab / tleague_projpage
☆144Updated 7 months ago
facebookresearch / hanabi_SAD
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
☆101Updated 3 years ago
implementation-matters / code-for-paper
☆111Updated 5 years ago
ying-wen / malib_deprecated
A Multi-agent Learning Framework
☆62Updated 4 years ago
harvard-edge / QuaRL
QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.
☆72Updated 2 years ago
staghuntrpg / RPG
This is the source code of RPG (Reward-Randomized Policy Gradient)
☆42Updated 2 years ago
tencent-ailab / TLeague
☆4Updated 7 months ago
openai / phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
☆262Updated 2 years ago
cyoon1729 / distributedRL
A framework for easy prototyping of distributed reinforcement learning algorithms
☆96Updated 4 years ago
createamind / DRL
☆92Updated 4 years ago
alshedivat / lola
Code release for Learning with Opponent-Learning Awareness and variations.
☆149Updated 2 years ago
JBLanier / pipeline-psro
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆51Updated 10 months ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆92Updated 3 weeks ago
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆102Updated 4 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 3 years ago
crisbodnar / pderl
Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020
☆52Updated 11 months ago
HumanCompatibleAI / human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆108Updated 2 years ago
PKU-RL / Literature
☆106Updated 4 years ago
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
microsoft / oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Updated last year
rmst / rtrl
PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)
☆73Updated 5 years ago
pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
younggyoseo / pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
☆45Updated 6 years ago
indylab / nxdo
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆39Updated 3 years ago