aijunbai / thompson-samplingLinks
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 9 years ago
Alternatives and similar repositories for thompson-sampling
Users that are interested in thompson-sampling are comparing it to the libraries listed below
Sorting:
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 4 years ago
- ☆16Updated 4 years ago
- CitySim3D: Simulated car following benchmark☆27Updated 2 years ago
- Planning algorithms for problems with uncertain world state and action outcomes (POMDP and MDP models)☆53Updated 3 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated last month
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆25Updated 2 months ago
- ☆54Updated 7 years ago
- Stabilizable Nonlinear Dynamics Learning☆21Updated 5 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Updated 6 years ago
- This repository contains implementations of the paper VUSFA☆14Updated 4 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- ☆43Updated 4 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35Updated 7 years ago
- Transfer Learning Environments☆11Updated 7 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆14Updated 4 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Basic Gaussian process regression library. (Eigen3 required)☆25Updated 9 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- Differentiable Gaussian Process Motion Planning☆51Updated 3 years ago
- Deep Learning Course Project☆11Updated 7 years ago
- Scalable MCTS for team scenarios☆16Updated last year
- ☆32Updated 6 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆37Updated 2 months ago
- Jointly learning policies and latent representations for driver behavior.☆15Updated 8 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- Neuronal Circuit Policies☆40Updated 2 years ago
- Efficient Point-Based POMDP Planning by Approximating☆90Updated 5 years ago