aijunbai / thompson-samplingLinks
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 9 years ago
Alternatives and similar repositories for thompson-sampling
Users that are interested in thompson-sampling are comparing it to the libraries listed below
Sorting:
- Stabilizable Nonlinear Dynamics Learning☆23Updated 6 years ago
- ☆43Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 3 years ago
- JAX-based implementation for multi-agent path planning (MAPP) in continuous spaces.☆54Updated 3 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 6 years ago
- Accelerating Quadratic Optimization with Reinforcement Learning☆94Updated 4 years ago
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆25Updated 10 months ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆59Updated 2 months ago
- ☆38Updated 3 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆68Updated 5 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- ☆17Updated 4 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 3 years ago
- Multi-agent active perception with prediction rewards☆11Updated 5 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 6 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆25Updated 6 years ago
- Performant, differentiable reinforcement learning☆125Updated 6 months ago
- Co-training for Policy Learning☆13Updated 6 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆27Updated 4 years ago
- Performant, differentiable reinforcement learning☆23Updated 2 years ago
- ☆14Updated 2 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 3 years ago
- CitySim3D: Simulated car following benchmark☆27Updated 3 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Updated 4 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago
- Differentiable Gaussian Process Motion Planning☆51Updated 4 years ago
- Hierarchical Reinforcement Learning (batteries included)☆48Updated 6 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 4 years ago
- Enforcing robust control guarantees within neural network policies☆56Updated 4 years ago