kevinzhangftw / Monte-Carlo-Tree-Search-Tic-Tac-Toe

MCTS Implementation in Python

☆9

Alternatives and similar repositories for Monte-Carlo-Tree-Search-Tic-Tac-Toe:

Users that are interested in Monte-Carlo-Tree-Search-Tic-Tac-Toe are comparing it to the libraries listed below

siemanko / guided-policy-search
Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).
☆43Updated 10 years ago
Officium / RL-Experiments
High-quality implementations of deep reinforcement learning algorithms for experiments
☆51Updated 7 months ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 9 years ago
facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
mlech26l / ordinary_neural_circuits
Neuronal Circuit Policies
☆40Updated 2 years ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆69Updated 7 years ago
ShibiHe / Poker-Fictitious-Play
Fictitious Self-play & Reinforcement Learning
☆18Updated 7 years ago
mcastron / BBRL
BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms
☆34Updated 9 years ago
JuliaPOMDP / POMCP.jl
Julia Implementation of the POMCP algorithm for solving POMDPs
☆12Updated 3 years ago
wulfebw / muzero
A python implemenation of tabular MuZero for educational purposes
☆21Updated 5 years ago
ericjang / e2c
TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆65Updated 8 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆50Updated 6 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆67Updated 5 years ago
flowersteam / geppg
☆35Updated 6 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆27Updated 6 years ago
mrkulk / hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
☆87Updated 7 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
itaicaspi / mgail
Model-Based Generative Adversarial Imitation Learning
☆89Updated 4 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆28Updated 8 years ago
MOCR / DDPG
reimplementation of the ddpg algorithm using tensorflow
☆38Updated 8 years ago
Feryal / craft-env
☆44Updated 6 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 6 years ago
krfricke / rl-benchmark
Reinforcement learning benchmarking.
☆40Updated 6 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
quanvuong / Supervised_Policy_Update
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Updated 2 years ago
ilyasu123 / trpo
☆19Updated 9 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35Updated 6 years ago