Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Jun 20, 2016Updated 10 years ago
Alternatives and similar repositories for thompson-sampling
Users that are interested in thompson-sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- General Purpose C++ Implementation for Inference and Learning in Bayesian and Markov Networks☆15Oct 29, 2018Updated 7 years ago
- A library for solving POMDPs☆11Apr 15, 2025Updated last year
- ☆13Mar 26, 2019Updated 7 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Monte Carlo value iteration for continuous-state POMDPs☆12Sep 3, 2013Updated 12 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Jun 26, 2026Updated last week
- hmdp is a C++ library and tools for solving Markov Decision Processes (MDPs) with hybrid discrete and/or continuous state-spaces.☆23Aug 26, 2019Updated 6 years ago
- Hierarchical Online Planning and Reinforcement Learning on Taxi☆32Oct 23, 2017Updated 8 years ago
- Speed profile planning via temporal optimization☆33Apr 1, 2019Updated 7 years ago
- Fast Solution of Optimal Control Problems With L1 Cost☆10Aug 9, 2019Updated 6 years ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- Behavior planner fusing runtime verification on traffic rules with single- and multi-agent Monte Carlo Tree Search☆11Jun 15, 2021Updated 5 years ago
- ☆13Nov 17, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Aug 6, 2021Updated 4 years ago
- Implementation of gaussian processes and bayesian optimization in tensorflow☆10Feb 14, 2016Updated 10 years ago
- Xcode 6 Project Templates. Missing your "Empty Application" template?☆12Sep 17, 2014Updated 11 years ago
- A repository around the Annual Computer Poker Competition server, for https://github.com/deepmind/open_spiel☆13Apr 13, 2021Updated 5 years ago
- ☆42Oct 18, 2018Updated 7 years ago
- ☆12May 7, 2019Updated 7 years ago
- ☆10Feb 2, 2019Updated 7 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- A Library of MDP algorithms for Artificial Intelligence☆18Jul 16, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- eigen-qld allow to use the QLD QP solver with the Eigen3 library.☆17Jun 16, 2026Updated 2 weeks ago
- Monte Carlo Tree Search with Reinforcement Learning for Motion Planning☆79Sep 23, 2020Updated 5 years ago
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- MPC trajectory tracking + reachability-based collision avoidance for pairwise vehicle interactions☆19Jul 10, 2020Updated 5 years ago
- A Data Converter for Nuplan and VAD(VADv2)☆24Nov 26, 2024Updated last year
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆22May 6, 2023Updated 3 years ago
- LSTM based neural network that predicts the state of the vehicle in terms of position and velocity.☆14May 7, 2021Updated 5 years ago
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- Python wrapper for ACPC poker bot infrastructure☆13May 20, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- C++ 极简之路,用繁琐的语言做简单的事。Exploring the simplest and deepest way of Cpp.☆21Feb 6, 2020Updated 6 years ago
- Derivation and implementation of continuous-time finite-horizon Linear Quadratic Regulator☆27Feb 20, 2016Updated 10 years ago
- This was a fork of sphinxcontrib-django, but all changes have been merged into the upstream repository.☆15Mar 2, 2023Updated 3 years ago
- Computationally low-cost interception trajectories for quadrocopters☆27Oct 23, 2019Updated 6 years ago
- drone controls☆16Oct 30, 2020Updated 5 years ago
- The programming assignments from CS228T offered in Spring 2012 at Stanford☆40Oct 16, 2012Updated 13 years ago