aijunbai / thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 8 years ago
Alternatives and similar repositories for thompson-sampling:
Users that are interested in thompson-sampling are comparing it to the libraries listed below
- Planning algorithms for problems with uncertain world state and action outcomes (POMDP and MDP models)☆52Updated 3 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 3 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Updated 3 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- ☆53Updated 6 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 6 years ago
- Stabilizable Nonlinear Dynamics Learning☆21Updated 5 years ago
- ☆36Updated last year
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Updated 3 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 5 years ago
- ☆16Updated 3 years ago
- TD-Regularized Actor-Critic Methods☆34Updated 5 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆36Updated 7 months ago
- Scalable MCTS for team scenarios☆15Updated 7 months ago
- Structured framework for learning mechanical systems in PyTorch☆24Updated 5 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- CitySim3D: Simulated car following benchmark☆27Updated 2 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated 7 months ago
- ☆47Updated 5 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆34Updated 6 years ago
- Differentiable Gaussian Process Motion Planning☆46Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- This is the accompannying code for the paper "SLAM-Safe Planner: Preventing Monocular SLAM Failure using Reinforcement Learning" and "Dat…☆16Updated 7 years ago
- ☆23Updated 4 years ago
- Comp 781 Project☆8Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- ☆11Updated 6 years ago
- Performant, differentiable reinforcement learning☆25Updated last year
- Accelerating Quadratic Optimization with Reinforcement Learning☆87Updated 3 years ago