pbsinclair42 / MCTSLinks
A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain
☆237Updated last year
Alternatives and similar repositories for MCTS
Users that are interested in MCTS are comparing it to the libraries listed below
Sorting:
- Python Implementations of Monte Carlo Tree Search☆319Updated 4 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- Monte carlo tree search in python☆624Updated 3 years ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆306Updated 2 years ago
- xingtian is a componentized library for the development and verification of reinforcement learning algorithms☆317Updated 2 years ago
- Multi-Objective Reinforcement Learning☆296Updated 4 years ago
- A parallel framework for population-based multi-agent reinforcement learning.☆546Updated 2 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆331Updated 2 years ago
- Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…☆247Updated 5 years ago
- Sokoban environment for OpenAI Gym☆390Updated 2 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆121Updated last year
- ☆315Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆385Updated 4 years ago
- Simple, readable, yet full-featured implementation of PPO in Pytorch☆51Updated 8 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆525Updated 3 years ago
- Pytorch Implementation of MuZero☆354Updated 2 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆41Updated 5 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆167Updated 2 years ago
- Demo of UCT (MCTS) in Python / Numpy☆88Updated 3 years ago
- PyTorch implementation of SAC-Discrete.☆314Updated last year
- ☆174Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆527Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆216Updated 10 months ago
- Code for conservative Q-learning☆468Updated 4 years ago
- Simple A3C implementation with pytorch + multiprocessing☆658Updated 2 years ago
- DQN to play Atari Pong☆114Updated 7 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆182Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆177Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Updated 7 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago