yanyangbaobeiIsEmma-zz / Reinforcement-Learning-Contextual-Bandits
☆11Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning-Contextual-Bandits:
Users that are interested in Reinforcement-Learning-Contextual-Bandits are comparing it to the libraries listed below
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated last year
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- ☆16Updated 4 years ago
- Tutorial for Multi-Stakeholder Recommender Systems☆22Updated 3 years ago
- ☆10Updated 4 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆19Updated 7 years ago
- Fair Benchmarks☆10Updated 6 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Estimators to perform off-policy evaluation☆13Updated 6 months ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆22Updated last year
- ☆29Updated 6 years ago
- Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding☆21Updated 2 years ago
- This repository contains the code used to run generate the data splits, run the hyperparameter tunings, and export the results presented …☆12Updated 2 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10Updated 5 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Updated 3 years ago
- Code for the RecSys20 paper -- Unbiased Implicit Recommendation and Propensity Estimation via Combinational Joint Learning☆10Updated 4 years ago
- Software relating to relational empirical risk minimization☆17Updated 3 years ago
- Offline evaluation of multi-armed bandit algorithms☆22Updated 4 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆10Updated 2 years ago
- (ICTIR2020) "Unbiased Pairwise Learning from Biased Implicit Feedback"☆18Updated 2 years ago
- Determinantal Point Processes in Julia☆12Updated 5 years ago
- ☆15Updated 5 years ago
- ☆30Updated 4 years ago
- gFM: An Efficient Solver for generalized Factorization Machine☆8Updated 8 years ago
- Experiments in Bayesian Machine Learning☆69Updated 5 years ago
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Updated 6 years ago
- ☆42Updated 6 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆14Updated 4 years ago