tmoer / MCTS-TLinks
Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'
☆12Updated 7 years ago
Alternatives and similar repositories for MCTS-T
Users that are interested in MCTS-T are comparing it to the libraries listed below
Sorting:
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 4 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Updated 2 years ago
- Demo of UCT (MCTS) in Python / Numpy☆88Updated 2 years ago
- TD-Regularized Actor-Critic Methods☆35Updated 5 years ago
- different AI algorithms to solve board games☆18Updated 7 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Updated 9 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 6 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 6 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 3 years ago
- ☆19Updated 4 years ago
- Reinforcement learning benchmarking.☆39Updated 7 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 3 years ago
- A C++ pytorch implementation of MuZero☆41Updated last year
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- Scalable MCTS for team scenarios☆16Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Updated 3 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆22Updated last year
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 5 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21Updated 2 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 6 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- ☆43Updated 4 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago