CyCTW / Parallel-MCTSLinks

Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.

☆46

Alternatives and similar repositories for Parallel-MCTS

Users that are interested in Parallel-MCTS are comparing it to the libraries listed below

Sorting:

liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆120Updated 4 years ago
waterhorse1 / NAC
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Updated 3 years ago
JBLanier / pipeline-psro
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆51Updated 9 months ago
lamda-bbo / madac
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆25Updated 2 years ago
MahanFathi / Model-Based-RL
Model-based Policy Gradients
☆31Updated 5 years ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Updated 2 years ago
avillaflor / SPLT-transformer
☆18Updated 2 years ago
indylab / nxdo
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆39Updated 3 years ago
daisatojp / mpo
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆76Updated 2 years ago
cassidylaidlaw / effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
☆48Updated last year
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
frt03 / generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆67Updated 2 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆90Updated 2 years ago
sfujim / LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆35Updated 3 years ago
ingambe / RayEnvWrapper
OpenAi's gym environment wrapper to vectorize them with Ray
☆23Updated 2 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
Miffyli / rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30Updated 5 years ago
MarcoMeter / neroRL
Deep Reinforcement Learning Framework done with PyTorch
☆36Updated 3 months ago
ymzhang01 / focops
Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).
☆28Updated 3 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆50Updated last month
seolhokim / DistributedRL-Pytorch-Ray
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Updated 3 years ago
mavischer / DRRL
A2C training of Relational Deep Reinforcement Learning Architecture
☆13Updated 3 years ago
yycdavid / program-synthesis-guided-RL
☆24Updated 2 years ago
Valarzz / DLPA
☆22Updated last year
ZhengyaoJiang / latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
☆108Updated 2 years ago
apexrl / CoDAIL
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆18Updated 4 years ago
felix-kerkhoff / DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
☆29Updated 2 years ago
mit-gfx / PGMORL
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
☆113Updated 4 years ago