Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Jun 20, 2016Updated 9 years ago
Alternatives and similar repositories for thompson-sampling
Users that are interested in thompson-sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Robot User Manuals☆16Dec 11, 2025Updated 3 months ago
- MATLAB toolbox for stochastic reachability (probabilistic verification and controller synthesis)☆12Sep 25, 2020Updated 5 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Updated this week
- ☆17Mar 21, 2021Updated 5 years ago
- Hierarchical Online Planning and Reinforcement Learning on Taxi☆32Oct 23, 2017Updated 8 years ago
- Speed profile planning via temporal optimization☆32Apr 1, 2019Updated 6 years ago
- Fast Solution of Optimal Control Problems With L1 Cost☆10Aug 9, 2019Updated 6 years ago
- Trajectory tracking control for wheeled mobile robots in a robot soccer field using Fuzzy Logic.☆12Jul 18, 2021Updated 4 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- Behavior planner fusing runtime verification on traffic rules with single- and multi-agent Monte Carlo Tree Search☆11Jun 15, 2021Updated 4 years ago
- ☆13Nov 17, 2021Updated 4 years ago
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Aug 6, 2021Updated 4 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- WIP : regular expressions for identifying and extracting values from HGVS nomenclature☆14Apr 15, 2018Updated 7 years ago
- Xcode 6 Project Templates. Missing your "Empty Application" template?☆12Sep 17, 2014Updated 11 years ago
- A repository around the Annual Computer Poker Competition server, for https://github.com/deepmind/open_spiel☆13Apr 13, 2021Updated 4 years ago
- Official code for paper: INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving☆39Dec 12, 2022Updated 3 years ago
- ☆41Oct 18, 2018Updated 7 years ago
- Basic React App built with Bazel☆11Feb 15, 2018Updated 8 years ago
- Multi-agent Monte Carlo Tree Search implementation in C++☆16Feb 10, 2022Updated 4 years ago
- ☆10Feb 2, 2019Updated 7 years ago
- Monte Carlo Tree Search with Reinforcement Learning for Motion Planning☆81Sep 23, 2020Updated 5 years ago
- Notebook from my blog☆15Apr 9, 2017Updated 8 years ago
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- POSQ controller for differential drive robots. It generates the trajectory which brings a differential drive robot with wheelbase of size…☆18Oct 7, 2015Updated 10 years ago
- Autocalibration method for accelerometer data☆12Jan 31, 2019Updated 7 years ago
- LSTM based neural network that predicts the state of the vehicle in terms of position and velocity.☆14May 7, 2021Updated 4 years ago
- Python wrapper for ACPC poker bot infrastructure☆13May 20, 2018Updated 7 years ago
- C++ 极简之路,用繁琐的语言做简单的事。Exploring the simplest and deepest way of Cpp.☆21Feb 6, 2020Updated 6 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 10 months ago
- An efficient path smoothing algorithm that has analytic solution. This algorithm provides curvature continuous path using cubic Bezier cu…☆22Apr 27, 2018Updated 7 years ago
- Derivation and implementation of continuous-time finite-horizon Linear Quadratic Regulator☆27Feb 20, 2016Updated 10 years ago
- This was a fork of sphinxcontrib-django, but all changes have been merged into the upstream repository.☆15Mar 2, 2023Updated 3 years ago
- Computationally low-cost interception trajectories for quadrocopters☆27Oct 23, 2019Updated 6 years ago
- drone controls☆16Oct 30, 2020Updated 5 years ago
- The programming assignments from CS228T offered in Spring 2012 at Stanford☆40Oct 16, 2012Updated 13 years ago
- Trajectory utilities for MAVs☆30Jul 16, 2017Updated 8 years ago
- ☆15May 4, 2024Updated last year