tmoer / MCTS-TLinks
Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'
☆11Updated 7 years ago
Alternatives and similar repositories for MCTS-T
Users that are interested in MCTS-T are comparing it to the libraries listed below
Sorting:
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆64Updated last year
- ☆16Updated 3 years ago
- Translating HTN planning problems to PDDL☆21Updated 4 years ago
- Scalable MCTS for team scenarios☆16Updated last year
- ☆35Updated 7 years ago
- Map-Elites based on Evolution Strategies☆32Updated 3 years ago
- ☆31Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 4 years ago
- A C++ pytorch implementation of MuZero☆40Updated last year
- Generalised UDRL☆37Updated 3 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- Official repository for the paper "Automating Continual Learning"☆16Updated 2 months ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- krazy grid world☆25Updated 5 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- Demo of UCT (MCTS) in Python / Numpy☆88Updated 2 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- Neural model of hierarchical reinforcement learning☆16Updated 7 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆37Updated 5 months ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆20Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Code for human intervention reinforcement learning☆34Updated 7 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Updated 4 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Evaluating different engineering tricks that make RL work☆15Updated 4 years ago