DenseLance / mcts-simple
mcts-simple is a Python3 library that allows reinforcement learning problems to be solved easily with its implementations of Monte Carlo Tree Search.
☆27Updated 9 months ago
Alternatives and similar repositories for mcts-simple:
Users that are interested in mcts-simple are comparing it to the libraries listed below
- Minimal code for A Generalist Agent☆39Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆41Updated 4 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆76Updated last year
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆17Updated last year
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆104Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆74Updated 2 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆39Updated 4 years ago
- ☆33Updated this week
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- MiniZero: An AlphaZero and MuZero Training Framework☆84Updated last month
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆80Updated last year
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆32Updated 4 years ago
- ☆53Updated 4 months ago
- Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]☆12Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated 2 weeks ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆116Updated 3 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- A C++ pytorch implementation of MuZero☆36Updated 10 months ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆25Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated last year
- An implementation of PPO in Pytorch☆69Updated last month
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago