aijunbai / thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 8 years ago
Alternatives and similar repositories for thompson-sampling:
Users that are interested in thompson-sampling are comparing it to the libraries listed below
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 6 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated 8 months ago
- Monte Carlo Tree Search - C++14 implementation☆40Updated last year
- Planning algorithms for problems with uncertain world state and action outcomes (POMDP and MDP models)☆52Updated 3 years ago
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Updated 3 years ago
- ☆16Updated 3 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 3 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆36Updated last week
- CitySim3D: Simulated car following benchmark☆27Updated 2 years ago
- ☆53Updated 7 years ago
- ☆36Updated 2 years ago
- TD-Regularized Actor-Critic Methods☆34Updated 5 years ago
- Stabilizable Nonlinear Dynamics Learning☆21Updated 5 years ago
- Efficient Point-Based POMDP Planning by Approximating☆88Updated 5 years ago
- Scalable MCTS for team scenarios☆15Updated 8 months ago
- ☆43Updated 3 years ago
- Source for Action Schema Networks paper (AAAI'18)☆31Updated last year
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Iterative Linearized Control Toolbox☆34Updated 6 months ago
- AA120Q Course Materials☆28Updated 3 weeks ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- Bayesian Optimal Monte Carlo Planning POMDP solver☆18Updated last year
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆14Updated 3 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Provides visualization tools for AutomotiveDrivingModels. Built on Cairo☆33Updated 4 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 5 years ago
- Deep Learning Course Project☆11Updated 7 years ago
- Generic Reinforcement Learning Library☆9Updated 2 months ago
- Accelerating Quadratic Optimization with Reinforcement Learning☆88Updated 3 years ago