albietz / cb_bakeoff

scripts for evaluation of contextual bandit algorithms

☆45

Alternatives and similar repositories for cb_bakeoff:

Users that are interested in cb_bakeoff are comparing it to the libraries listed below

akshaykr / oracle_cb
Experimentation for oracle based contextual bandit algorithms.
☆31Updated 2 years ago
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆66Updated 3 years ago
SoluMilken / Contextual-Bandit
Contextual Bandit Algorithms (+Bandit Algorithms)
☆22Updated 5 years ago
ntucllab / striatum
Contextual bandit in python
☆111Updated 3 years ago
iurteaga / bandits
Public repository for the work on bandit problems
☆23Updated 11 months ago
HCDM / BanditLib
Library of contextual bandits algorithms
☆334Updated last year
adith387 / slates_semisynth_expts
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Updated 7 years ago
dquail / NonStationaryBandit
Non stationary bandit for experiments with Reinforcement Learning
☆34Updated 8 years ago
qingyun-wu / NonstationaryBanditLib
☆15Updated 5 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
allenday / contextual-bandit
working example of a contextual multi-armed bandit
☆55Updated 5 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆83Updated 4 years ago
niffler92 / Bandit
Bandit algorithms
☆29Updated 7 years ago
LaunchpadAI / space-bandits
☆103Updated 3 years ago
criteo-research / optimization-continuous-action-crm
☆30Updated 4 years ago
rjagerman / wsdm2019-nonstationary
Non-stationary Off-policy Evaluation
☆13Updated 6 years ago
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆132Updated 2 years ago
usaito / counterfactual-cv
(ICML2020) “Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models’’
☆31Updated 2 years ago
rayshi14 / LinearUCB-python
Linear UCB bandit learning algorithm L Li(2010) python code
☆18Updated 10 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆92Updated 3 years ago
ermongroup / best-arm-delayed
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆19Updated 6 years ago
jldbc / bandits
Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset
☆56Updated 4 years ago
andrewk1 / pytorch-deep-bayesian-bandits
PyTorch port and extension of the Deep Bayesian Bandits Library
☆42Updated 5 years ago
ondrejbiza / bandits
Comparison of bandit algorithms from the Reinforcement Learning bible.
☆17Updated 6 years ago
Alanthink / banditpylib
A lightweight python library for bandit algorithms
☆30Updated 2 years ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆24Updated 4 years ago
VowpalWabbit / coba
Contextual bandit benchmarking
☆49Updated 6 months ago
abhi1345 / deep-q-rank
A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…
☆27Updated 10 months ago
usaito / recsys2021-tutorial
https://sites.google.com/cornell.edu/recsys2021tutorial
☆53Updated 3 years ago
clvoloshin / constrained_batch_policy_learning
☆26Updated 5 years ago