aijunbai / thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆14Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for thompson-sampling
- Stabilizable Nonlinear Dynamics Learning☆21Updated 5 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 6 years ago
- Planning algorithms for problems with uncertain world state and action outcomes (POMDP and MDP models)☆52Updated 3 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆53Updated 4 months ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆16Updated 3 years ago
- CitySim3D: Simulated car following benchmark☆27Updated last year
- Differentiable Gaussian Process Motion Planning☆46Updated 3 years ago
- ☆53Updated 6 years ago
- Structured framework for learning mechanical systems in PyTorch☆23Updated 5 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- ☆35Updated last year
- ☆23Updated 4 years ago
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆19Updated last month
- Simple optimal control framework for python☆13Updated 6 years ago
- Scalable MCTS for team scenarios☆15Updated 4 months ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- AA120Q Course Materials☆28Updated 2 weeks ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Updated 3 years ago
- Efficient Point-Based POMDP Planning by Approximating☆87Updated 4 years ago
- Model-based reinforcement learning using CEM, MPC and PETS☆16Updated 4 years ago
- ☆43Updated 3 years ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆12Updated 4 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- Library for simulation of nonlinear control systems, control design, and Lyapunov-based learning.☆37Updated last year
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆36Updated 4 months ago
- Generic Reinforcement Learning Library☆9Updated this week
- Google AI Princeton control framework☆38Updated 4 years ago
- Derivation and implementation of continuous-time finite-horizon Linear Quadratic Regulator☆22Updated 8 years ago
- Iterative Linearized Control Toolbox☆35Updated 3 months ago