sergeim19 / SinglePlayerMCTSLinks
Single Player Monte Carlo Tree Search implementation
☆18Updated 5 years ago
Alternatives and similar repositories for SinglePlayerMCTS
Users that are interested in SinglePlayerMCTS are comparing it to the libraries listed below
Sorting:
- hierarchical Q-learning implementation☆11Updated 9 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- ☆35Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Logarithmic Reinforcement Learning☆26Updated 2 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Demo of UCT (MCTS) in Python / Numpy☆86Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 6 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆29Updated last year
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- ☆77Updated 7 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆38Updated 7 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago