aijunbai / thompson-samplingLinks
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
☆15Updated 9 years ago
Alternatives and similar repositories for thompson-sampling
Users that are interested in thompson-sampling are comparing it to the libraries listed below
Sorting:
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 4 years ago
- Stabilizable Nonlinear Dynamics Learning☆22Updated 5 years ago
- ☆16Updated 4 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 2 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated last month
- ☆43Updated 4 years ago
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆26Updated 4 months ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Updated 6 years ago
- Performant, differentiable reinforcement learning☆24Updated 2 years ago
- Accelerating Quadratic Optimization with Reinforcement Learning☆92Updated 3 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- Great resources for learning optimal control☆18Updated 6 years ago
- ☆36Updated 2 years ago
- Differentiable Gaussian Process Motion Planning☆51Updated 3 years ago
- JAX-based implementation for multi-agent path planning (MAPP) in continuous spaces.☆53Updated 2 years ago
- ☆54Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Neuronal Circuit Policies☆40Updated 3 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆15Updated 3 years ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆14Updated 5 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 7 years ago
- Co-training for Policy Learning☆13Updated 6 years ago
- Performant, differentiable reinforcement learning☆122Updated last week
- Online variational GPs☆37Updated 2 years ago
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12Updated 3 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Updated 3 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆68Updated 5 years ago
- Structured framework for learning mechanical systems in PyTorch☆26Updated 6 years ago