brilee / python_uctLinks

Demo of UCT (MCTS) in Python / Numpy

☆88

Alternatives and similar repositories for python_uct

Users that are interested in python_uct are comparing it to the libraries listed below

Sorting:

YuriCat / MuZeroJupyterExample
☆67Updated 3 years ago
hildensia / mcts
An implementation of Monte Carlo Tree Search in python
☆162Updated 4 years ago
alshedivat / lola
Code release for Learning with Opponent-Learning Awareness and variations.
☆149Updated 2 years ago
Officium / RL-Experiments
High-quality implementations of deep reinforcement learning algorithms for experiments
☆51Updated 11 months ago
jbradberry / mcts
Board game AI implementations using Monte Carlo Tree Search
☆184Updated 5 years ago
BorealisAI / pommerman-baseline
Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆37Updated 6 years ago
flyyufelix / C51-DDQN-Keras
C51-DDQN in Keras
☆126Updated 7 years ago
wulfebw / muzero
A python implemenation of tabular MuZero for educational purposes
☆21Updated 5 years ago
brendanator / atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
☆138Updated last year
cyoon1729 / distributedRL
A framework for easy prototyping of distributed reinforcement learning algorithms
☆96Updated 4 years ago
JKCooper2 / gym-bandits
Bandits Environments for the OpenAI Gym
☆89Updated 5 years ago
unixpickle / anyrl-py
A reinforcement learning framework
☆155Updated 6 years ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆120Updated 4 years ago
rubenrtorrado / GVGAI_GYM
☆106Updated 5 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago
zuoxingdong / mazelab
A customizable framework to create maze and gridworld environments
☆268Updated 6 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆96Updated 6 years ago
kashif / firedup
Clone of OpenAI's Spinning Up in PyTorch
☆151Updated 3 years ago
tambetm / pommerman-baselines
Some baselines for Pommerman competition
☆46Updated 7 years ago
intel / cerl
☆72Updated 2 years ago
lusob / gym-ple
This package allows to use PLE as a gym environment.
☆72Updated 5 years ago
uber-research / atari-model-zoo
A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…
☆202Updated 5 years ago
createamind / DRL
☆92Updated 4 years ago
openai / EPG
Code for the paper "Evolved Policy Gradients"
☆250Updated 6 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Updated 6 years ago
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆203Updated 2 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆67Updated 7 years ago
rgal / gym-2048
Open AI gym environment for the game 2048
☆73Updated 3 years ago
activatedgeek / torchrl
Highly Modular and Scalable Reinforcement Learning
☆115Updated 5 years ago
google-research / episodic-curiosity
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
☆204Updated 4 years ago