brilee / python_uctLinks
Demo of UCT (MCTS) in Python / Numpy
☆88Updated 2 years ago
Alternatives and similar repositories for python_uct
Users that are interested in python_uct are comparing it to the libraries listed below
Sorting:
- An implementation of Monte Carlo Tree Search in python☆163Updated 5 years ago
- Board game AI implementations using Monte Carlo Tree Search☆184Updated 5 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆126Updated last year
- ☆71Updated 2 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Updated 5 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆116Updated 2 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆152Updated 3 years ago
- Bandits Environments for the OpenAI Gym☆89Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆101Updated 5 years ago
- ☆66Updated 3 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆55Updated 7 years ago
- This package allows to use PLE as a gym environment.☆72Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆53Updated 7 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 7 months ago
- C51-DDQN in Keras☆126Updated 7 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Updated 9 years ago
- Actor-critic with experience replay☆256Updated 3 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆244Updated 3 years ago
- Gridworld environments for OpenAI gym.☆79Updated last year
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Augmented environments with RL☆103Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆118Updated 5 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 5 years ago