kevinzhangftw / Monte-Carlo-Tree-Search-Tic-Tac-Toe
MCTS Implementation in Python
☆9Updated 8 years ago
Alternatives and similar repositories for Monte-Carlo-Tree-Search-Tic-Tac-Toe:
Users that are interested in Monte-Carlo-Tree-Search-Tic-Tac-Toe are comparing it to the libraries listed below
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆43Updated 10 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 7 months ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Neuronal Circuit Policies☆40Updated 2 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms☆34Updated 9 years ago
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆65Updated 8 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- some common TD Learning algorithms☆67Updated 5 years ago
- ☆35Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 7 years ago
- Model-Based Generative Adversarial Imitation Learning☆89Updated 4 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- ☆44Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Reinforcement learning benchmarking.☆40Updated 6 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- ☆54Updated 7 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- ☆19Updated 9 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35Updated 6 years ago