abietti/cb_bakeoff

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abietti/cb_bakeoff)

abietti / cb_bakeoff

scripts for evaluation of contextual bandit algorithms

☆46

Alternatives and similar repositories for cb_bakeoff

Users that are interested in cb_bakeoff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sgiguere / RobinHood-NeurIPS-2019
View on GitHub
Implementation of safe offline bandit algorithms.
☆10Oct 27, 2019Updated 6 years ago
VowpalWabbit / estimators
View on GitHub
Estimators to perform off-policy evaluation
☆13Sep 3, 2024Updated last year
qingyun-wu / NonstationaryBanditLib
View on GitHub
☆15Jan 20, 2020Updated 6 years ago
ermongroup / best-arm-delayed
View on GitHub
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
david-cortes / contextualbandits
View on GitHub
Python implementations of contextual bandits algorithms
☆838Jun 28, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
adith387 / slates_semisynth_expts
View on GitHub
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Nov 2, 2017Updated 8 years ago
VowpalWabbit / py-vowpal-wabbit-next
View on GitHub
Experimental new Python bindings for the VowpalWabbit library
☆12Oct 5, 2023Updated 2 years ago
actionml / contextual-bandit
View on GitHub
☆20Mar 15, 2017Updated 9 years ago
daturkel / sd_bandits
View on GitHub
☆15Dec 14, 2020Updated 5 years ago
usaito / recsys2021-tutorial
View on GitHub
https://sites.google.com/cornell.edu/recsys2021tutorial
☆58Mar 21, 2022Updated 4 years ago
rayshi14 / LinearUCB-python
View on GitHub
Linear UCB bandit learning algorithm L Li(2010) python code
☆19Oct 6, 2014Updated 11 years ago
VowpalWabbit / jupyter-notebooks
View on GitHub
Vowpal Wabbit examples and tutorials
☆21Jan 20, 2022Updated 4 years ago
HCDM / BanditLib
View on GitHub
Library of contextual bandits algorithms
☆343Mar 14, 2024Updated 2 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
umeshksingla / news-recommend-ire
View on GitHub
Stream Data based News Recommendation - Contextual Bandit Approach
☆47Nov 15, 2017Updated 8 years ago
sony / pyIEOE
View on GitHub
☆32Feb 21, 2025Updated last year
thejat / scalable-data-driven-assortment-planning
View on GitHub
Code accompanying paper titled "Optimizing Revenue over Data-driven Assortments" (2021)
☆26Aug 7, 2021Updated 4 years ago
usaito / dr-ranking-metric
View on GitHub
(RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"
☆23Mar 25, 2023Updated 3 years ago
google-research / deep_ope
View on GitHub
☆88Jul 30, 2024Updated last year
finnhacks42 / causal_bandits
View on GitHub
☆17Oct 25, 2016Updated 9 years ago
counterfactual-ml / kdd2022-tutorial
View on GitHub
Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances
☆12Aug 14, 2022Updated 3 years ago
akshaykr / oracle_cb
View on GitHub
Experimentation for oracle based contextual bandit algorithms.
☆33Sep 12, 2022Updated 3 years ago
fated / libcp
View on GitHub
LibCP -- A Library for Conformal Prediction
☆13Feb 26, 2015Updated 11 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sanghack81 / SCMMAB-NIPS2018
View on GitHub
Structural Causal Bandit
☆27Jun 27, 2026Updated last month
DBaudry / Information_Directed_Sampling
View on GitHub
Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
☆21Jan 18, 2019Updated 7 years ago
SuReLI / llrl
View on GitHub
Lipschitz Lifelong RL
☆11Nov 6, 2020Updated 5 years ago
hpclab / LtR-Tutorial
View on GitHub
Efficiency/Effectiveness Trade-offs in Learning to Rank
☆12Sep 11, 2018Updated 7 years ago
zxie / vae
View on GitHub
Variational autoencoder in Theano
☆11Sep 14, 2017Updated 8 years ago
HaoyueBaiZJU / NAS-OoD
View on GitHub
☆12Nov 18, 2022Updated 3 years ago
VincentShenbw / similarityjoin
View on GitHub
Implementation of many similarity join algorithms.
☆15Mar 6, 2014Updated 12 years ago
aiueola / wsdm2022-cascade-dr
View on GitHub
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Jul 16, 2023Updated 3 years ago
rgmyr / tf-ProSeNet
View on GitHub
A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)
☆13Dec 19, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tansey / tstd0
View on GitHub
An experiment with Thompson sampling and TD(0) on a grid world variant
☆17Nov 8, 2013Updated 12 years ago
ajaech / query_completion
View on GitHub
Personalized Query Completion
☆27Nov 21, 2020Updated 5 years ago
wapc / wapc-guest-zig
View on GitHub
SDK for creating waPC WebAssembly Guest Modules in Zig
☆14Dec 27, 2021Updated 4 years ago
oppsitre / RLift
View on GitHub
Reinforcement Learning for Uplift Modeling
☆13Mar 13, 2021Updated 5 years ago
ntucllab / striatum
View on GitHub
Contextual bandit in python
☆112Jul 7, 2021Updated 5 years ago
Akella17 / Deep-Bayesian-Quadrature-Policy-Optimization
View on GitHub
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
☆17Feb 17, 2021Updated 5 years ago
dhpark22 / collranking
View on GitHub
☆13Nov 15, 2016Updated 9 years ago