yanyangbaobeiIsEmma-zz / Reinforcement-Learning-Contextual-BanditsLinks

☆11

Alternatives and similar repositories for Reinforcement-Learning-Contextual-Bandits

Users that are interested in Reinforcement-Learning-Contextual-Bandits are comparing it to the libraries listed below

Sorting:

dferendo / RecommendationSystem
☆10Updated 4 years ago
aiueola / wsdm2022-cascade-dr
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Updated 2 years ago
rjagerman / wsdm2019-nonstationary
Non-stationary Off-policy Evaluation
☆13Updated 6 years ago
jtcho / FairMachineLearning
Implementation of provably Rawlsian fair ML algorithms for contextual bandits.
☆14Updated 8 years ago
svrijenhoek / RADio
☆11Updated last year
sgiguere / RobinHood-NeurIPS-2019
Implementation of safe offline bandit algorithms.
☆10Updated 5 years ago
criteo-research / tf-tile
TF-Tile: an efficient sparse representation for real-valued data
☆14Updated 2 years ago
nd7141 / recsystutorial
☆16Updated 4 years ago
RobRomijnders / bandit
Implementation of Counterfactual risk minimization
☆26Updated 8 years ago
adith387 / slates_semisynth_expts
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Updated 7 years ago
mesuvash / TFMF
Biased matrix factorisation using TensorFlow
☆19Updated 9 years ago
MingLin-home / gFM
gFM: An Efficient Solver for generalized Factorization Machine
☆8Updated 8 years ago
ContentWise / contentwise-impressions
This repository contains the code used to run generate the data splits, run the hyperparameter tunings, and export the results presented …
☆12Updated 2 years ago
frederickayala / session-based-recsys
Tutorials on session-based recommender systems
☆11Updated 8 years ago
spotify-research / RIPS_KDD2020
☆18Updated 4 years ago
olivierjeunen / dual-bandit-kdd-2020
Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.
☆23Updated 2 years ago
Zziwei / Unbiased-Propensity-and-Recommendation
Code for the RecSys20 paper -- Unbiased Implicit Recommendation and Propensity Estimation via Combinational Joint Learning
☆10Updated 4 years ago
counterfactual-ml / kdd2022-tutorial
Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances
☆12Updated 2 years ago
antoine-hochart / bandit_algo_evaluation
Offline evaluation of multi-armed bandit algorithms
☆23Updated 4 years ago
vruvora / reinforcement-learning-kdd
☆42Updated 6 years ago
criteo-research / optimization-continuous-action-crm
☆30Updated 5 years ago
irecsys / Tutorial_MSRS
Tutorial for Multi-Stakeholder Recommender Systems
☆22Updated 3 years ago
vub-dl / u-cmab
Uplifted Contextual Multi-Armed Bandit
☆19Updated 3 years ago
rutgerswiselab / PAP-REC
☆11Updated last year
MilkaLichtblau / BA_Laura
Fair Benchmarks
☆10Updated 6 years ago
IlyaTrofimov / bb2vec
This is the source code of the paper "Inferring Complementary Products from Baskets and Browsing Sessions"
☆11Updated 6 years ago
alshedivat / DeterminantalPointProcesses.jl
Determinantal Point Processes in Julia
☆12Updated 5 years ago
moorecsys / moorecsys.github.io
Tutorial on Multi-Objective Recommender Systems @ KDD 2021
☆19Updated 2 years ago
alexbeutel / FlexiFaCT
Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.
☆18Updated 7 years ago
hyz20 / D2Co
Uncovering User Interest from Biased and Noised Watch Time in Video Recommendation. In Recsys23.
☆10Updated last year