sergeim19 / SinglePlayerMCTS
Single Player Monte Carlo Tree Search implementation
☆18Updated 5 years ago
Alternatives and similar repositories for SinglePlayerMCTS:
Users that are interested in SinglePlayerMCTS are comparing it to the libraries listed below
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- A simple reinforcement learning simulation engine for OpenAI's gym.☆38Updated 6 years ago
- Demo of UCT (MCTS) in Python / Numpy☆85Updated 2 years ago
- ☆35Updated 6 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 6 years ago
- hierarchical Q-learning implementation☆11Updated 9 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 6 months ago
- ☆36Updated 8 years ago
- Logarithmic Reinforcement Learning☆26Updated last year
- ☆44Updated 6 years ago
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆34Updated 2 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- FEN Code☆37Updated 5 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 6 years ago