aijunbai / thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 8 years ago
Alternatives and similar repositories for thompson-sampling:
Users that are interested in thompson-sampling are comparing it to the libraries listed below
- ☆36Updated 2 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 7 years ago
- Stabilizable Nonlinear Dynamics Learning☆21Updated 5 years ago
- ☆54Updated 7 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 3 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- ☆16Updated 4 years ago
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆25Updated last week
- Scalable MCTS for team scenarios☆16Updated 10 months ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- ☆43Updated 4 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- Performant, differentiable reinforcement learning☆25Updated last year
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Updated 5 years ago
- Differentiable Gaussian Process Motion Planning☆48Updated 3 years ago
- Planning algorithms for problems with uncertain world state and action outcomes (POMDP and MDP models)☆53Updated 3 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated 10 months ago
- Online variational GPs☆32Updated last year
- Library for simulation of nonlinear control systems, control design, and Lyapunov-based learning.☆40Updated 2 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆14Updated 3 years ago
- Google AI Princeton control framework☆38Updated 4 years ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆13Updated 4 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Updated 7 years ago
- CitySim3D: Simulated car following benchmark☆27Updated 2 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- Iterative Linearized Control Toolbox☆36Updated 8 months ago
- Starter analysis suite for OracleNet path planner☆24Updated 6 years ago
- Accelerating Quadratic Optimization with Reinforcement Learning☆89Updated 3 years ago