brilee / python_uctLinks
Demo of UCT (MCTS) in Python / Numpy
☆88Updated 2 years ago
Alternatives and similar repositories for python_uct
Users that are interested in python_uct are comparing it to the libraries listed below
Sorting:
- ☆67Updated 3 years ago
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 11 months ago
- Board game AI implementations using Monte Carlo Tree Search☆184Updated 5 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆138Updated last year
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- Bandits Environments for the OpenAI Gym☆89Updated 5 years ago
- A reinforcement learning framework☆155Updated 6 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- ☆106Updated 5 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago
- A customizable framework to create maze and gridworld environments☆268Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 6 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆151Updated 3 years ago
- Some baselines for Pommerman competition☆46Updated 7 years ago
- ☆72Updated 2 years ago
- This package allows to use PLE as a gym environment.☆72Updated 5 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆202Updated 5 years ago
- ☆92Updated 4 years ago
- Code for the paper "Evolved Policy Gradients"☆250Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆181Updated 6 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆203Updated 2 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- Open AI gym environment for the game 2048☆73Updated 3 years ago
- Highly Modular and Scalable Reinforcement Learning☆115Updated 5 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago