aijunbai / thompson-samplingLinks
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 9 years ago
Alternatives and similar repositories for thompson-sampling
Users that are interested in thompson-sampling are comparing it to the libraries listed below
Sorting:
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- ☆54Updated 7 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- ☆16Updated 4 years ago
- Stabilizable Nonlinear Dynamics Learning☆21Updated 5 years ago
- ☆43Updated 4 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 7 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 4 years ago
- Accelerating Quadratic Optimization with Reinforcement Learning☆91Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated last week
- Multi-agent active perception with prediction rewards☆11Updated 4 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Implementation of GAIL and AIRL using chinerrl☆17Updated 3 years ago
- Great resources for learning optimal control☆18Updated 6 years ago
- JAX-based implementation for multi-agent path planning (MAPP) in continuous spaces.☆53Updated 2 years ago
- Scalable MCTS for team scenarios☆16Updated last year
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆28Updated 3 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆16Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- Comp 781 Project☆9Updated 6 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Updated 4 years ago
- Model-based reinforcement learning using CEM, MPC and PETS☆16Updated 5 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- ☆32Updated 6 years ago
- Planning algorithms for problems with uncertain world state and action outcomes (POMDP and MDP models)☆53Updated 3 years ago
- Efficient Point-Based POMDP Planning by Approximating☆90Updated 5 years ago
- CitySim3D: Simulated car following benchmark☆27Updated 2 years ago
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆65Updated 3 years ago
- Google AI Princeton control framework☆38Updated 4 years ago