DenseLance / mcts-simpleLinks

mcts-simple is a Python3 library that allows reinforcement learning problems to be solved easily with its implementations of Monte Carlo Tree Search.

☆27

Alternatives and similar repositories for mcts-simple

Users that are interested in mcts-simple are comparing it to the libraries listed below

Sorting:

PatrickKorus / mcts-general
General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.
☆40Updated 4 years ago
bhansconnect / fast-alphazero-general
A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆44Updated 2 years ago
MarcoMeter / neroRL
Deep Reinforcement Learning Framework done with PyTorch
☆36Updated 3 months ago
tmoer / a0c
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Updated 4 years ago
HumanCompatibleAI / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆30Updated 3 years ago
icaros-usc / dqd-rl
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
☆20Updated 2 years ago
faameunier / MCTSnet
A PyTorch implementation of DeepMind's MCTSnet
☆18Updated 2 years ago
yobibyte / amorpheus
My Body Is A Cage
☆41Updated 4 years ago
toshikwa / rljax
A collection of RL algorithms written in JAX.
☆98Updated 2 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
☆22Updated 3 years ago
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated 7 months ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆90Updated 2 years ago
quasimetric-learning / torch-quasimetric
PyTorch Package For Quasimetric Learning
☆42Updated 7 months ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
remosasso / PSDRL
Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023
☆27Updated last year
salesforce / sibling-rivalry
Code for Sibling Rivalry and experiments presented in associated paper
☆17Updated last month
twni2016 / Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆62Updated last year
etaoxing / multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch
☆47Updated 2 years ago
ollebompa / PGA-MAP-Elites
Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…
☆57Updated 3 years ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
ollebompa / QDgym
Repository for the QDgym code. A framework for Quality Diversity optimization benchmark tasks based OpenAI Gym.
☆23Updated 4 years ago
tianjunz / NovelD
☆40Updated 3 years ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆120Updated 4 years ago
google-research / pisac
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
☆44Updated 2 years ago
pairlab / d2rl
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆39Updated 4 years ago
YuhangSong / Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Updated 3 months ago
qgallouedec / lge
☆31Updated last year
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 2 years ago
jianzhnie / RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆16Updated 8 months ago