15418-final / ParallelizedMCTSLinks

☆15

Alternatives and similar repositories for ParallelizedMCTS

Users that are interested in ParallelizedMCTS are comparing it to the libraries listed below

Sorting:

facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
ShangtongZhang / DistributedES
Distributed implementation of popular evolutionary methods
☆64Updated 7 years ago
AdeelMufti / RL-RND
Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation
☆31Updated 6 years ago
deepsense-ai / Distributed-BA3C
☆56Updated 2 years ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆69Updated 7 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
sadeqa / Super-Mario-Bros-RL
This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…
☆79Updated 6 years ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆120Updated 4 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
PatrykChrabaszcz / Canonical_ES_Atari
Benchmarking Canonical Evolution Strategies for Playing Atari
☆83Updated 7 years ago
flowersteam / geppg
☆35Updated 6 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
floodsung / a2c_cartpole_pytorch
advantage actor-critic reinforcement learning for openai gym cartpole
☆65Updated 8 years ago
jingweiz / pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Updated 6 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆95Updated 4 years ago
kimhc6028 / policy-gradient-importance-sampling
Policy gradient reinforcement learning algorithm with importance sampling
☆32Updated 7 years ago
idlrl / flare
RL framework for embodied agents based on PyTorch
☆12Updated 6 years ago
quanvuong / Supervised_Policy_Update
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Updated 2 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 7 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
gd-zhang / ACKTR
Actor Critic using Kronecker-Factored Trust Region
☆19Updated 7 years ago
illidanlab / rpg
Ranking Policy Gradient
☆23Updated 5 years ago
lansiz / eqpt
The Path to Nash Equilibrium
☆38Updated 2 years ago
AnujMahajanOxf / VIREL
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Updated 5 years ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Updated 4 years ago
wulfebw / hierarchical_rl
hierarchical deep reinforcement learning algorithms
☆41Updated 7 years ago
wassname / world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…
☆32Updated 6 years ago
LiubovSobolevskaya / tf2-a2c-ppo
mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.
☆9Updated 5 years ago
petosa / multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
☆34Updated 4 years ago
ZhengyaoJiang / NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
☆76Updated 5 years ago