andrecianflone / thompsonLinks

Thompson Sampling Tutorial

☆53

Alternatives and similar repositories for thompson

Users that are interested in thompson are comparing it to the libraries listed below

Sorting:

sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆95Updated 3 years ago
akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆87Updated 4 years ago
dquail / NonStationaryBandit
Non stationary bandit for experiments with Reinforcement Learning
☆34Updated 8 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆86Updated 5 years ago
chariff / GPro
Python package for Preference Learning with Gaussian Processes.
☆33Updated 3 years ago
causal-rl-anonymous / causal-rl
☆43Updated 3 years ago
thanhnguyentang / offline_neural_bandits
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆14Updated 3 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆24Updated 4 years ago
siemens / industrialbenchmark
Industrial Benchmark
☆129Updated 2 years ago
ntucllab / striatum
Contextual bandit in python
☆114Updated 3 years ago
RonyAbecidan / Neural-Thompson-Sampling
Study of the paper 'Neural Thompson Sampling' published in October 2020
☆22Updated 2 years ago
iankurgarg / Reinforcement-Learning-Feature-Selection
Feature selection for maximizing expected cumulative reward
☆30Updated 7 years ago
jldbc / bandits
Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset
☆56Updated 4 years ago
debmandal / RL-Causality
References at the Intersection of Causality and Reinforcement Learning
☆89Updated 4 years ago
dtak / mbrl-smdp-ode
PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020
☆42Updated 4 years ago
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆161Updated 4 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆52Updated 6 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
thanard / me-trpo
☆92Updated last year
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆55Updated 3 months ago
clvoloshin / COBS
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 2 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
yfletberliac / rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
☆88Updated 5 years ago
johannesnauta / pytorch-pne
PyTorch implementation of Probabilistic Network Ensembles on toy problems
☆23Updated 2 years ago
uclaml / NeuralUCB
☆34Updated 4 years ago
google-research / deep_ope
☆86Updated 10 months ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆92Updated last week
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
archsyscall / DistRL-TensorFlow2
🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.
☆69Updated 4 years ago