tmoer / MCTS-T
Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'
☆11Updated 6 years ago
Alternatives and similar repositories for MCTS-T:
Users that are interested in MCTS-T are comparing it to the libraries listed below
- Scalable MCTS for team scenarios☆16Updated 10 months ago
- different AI algorithms to solve board games☆18Updated 6 years ago
- Development of a virtual quadruped robot using OpenAI & Mujoco☆16Updated 2 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Updated 8 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- Neural model of hierarchical reinforcement learning☆16Updated 7 years ago
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆28Updated 2 years ago
- ☆16Updated 4 years ago
- ☆19Updated 4 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 3 years ago
- Comp 781 Project☆9Updated 6 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- The official implementation of Memory-efficient DQN algorithm.☆10Updated last year
- Understanding RL vision Distill article☆23Updated 2 years ago
- Implementation of my Bayesian Optimization algorithms☆12Updated 7 years ago
- Official repository for the paper "Automating Continual Learning"☆14Updated last year
- Python Robot Simulator☆19Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆14Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Updated 4 years ago
- a lightweight implementation of Cartesian genetic programming with symbolic regression in mind.☆23Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 5 years ago