CyCTW / Parallel-MCTS
Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.
☆40Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Parallel-MCTS
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆110Updated 3 years ago
- ☆24Updated 2 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Updated 2 years ago
- Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021☆33Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated 3 weeks ago
- ☆48Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆65Updated last year
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆35Updated 4 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆70Updated 11 months ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆87Updated 11 months ago
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆94Updated last year
- an environment based on XLA for deep learning compiler optimization research.☆23Updated last year
- Simple, readable, yet full-featured implementation of PPO in Pytorch☆44Updated 2 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆72Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆65Updated 3 years ago
- PyTorch implementation for the Deep Symbolic Simplification Without Human Knowledge☆14Updated 3 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆101Updated 4 years ago
- Standard interface for entity based reinforcement learning environments.☆36Updated 8 months ago
- ☆33Updated 2 months ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆29Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆37Updated 3 years ago
- PPO with multi-head/autoregressive action outputs☆36Updated 3 years ago
- ☆34Updated last year