adith387 / slates_semisynth_exptsLinks
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Updated 8 years ago
Alternatives and similar repositories for slates_semisynth_expts
Users that are interested in slates_semisynth_expts are comparing it to the libraries listed below
Sorting:
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆23Updated 2 years ago
- Contextual bandit in python☆112Updated 4 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated 2 years ago
- Linear UCB bandit learning algorithm L Li(2010) python code☆19Updated 11 years ago
- ☆30Updated 5 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Updated 2 years ago
- ☆35Updated 7 years ago
- This is an implementation of the Dual Learning Algorithm with multi-layer feed-forward neural network for online unbiased learning to ran…☆89Updated 2 years ago
- Non-stationary Off-policy Evaluation☆13Updated 7 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 6 years ago
- ☆16Updated 8 years ago
- Experimentation for oracle based contextual bandit algorithms.☆33Updated 3 years ago
- ☆68Updated 2 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10Updated 6 years ago
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Updated 5 years ago
- Code for Policy Learning for Fairness in Ranking paper at NeurIPS 2019☆20Updated 3 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆38Updated 2 years ago
- Exposure Matrix Factorization: modeling user exposure in recommendation☆96Updated 9 years ago
- Toy implementation of SLIM and SSLIM Recommendation methods.☆42Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- ☆87Updated 5 years ago
- Code for RecSys'19 paper: Leveraging Post-click Feedback for Content Recommendations☆15Updated 4 years ago
- Neural models for Collaborative Filtering☆127Updated 6 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 3 years ago
- ☆16Updated 5 years ago
- Accompanying code for reproducing experiments from the HybridSVD paper. Preprint is available at https://arxiv.org/abs/1802.06398.☆24Updated 6 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Updated 8 years ago
- Factorization Machine for regression and classification☆98Updated 8 years ago
- ☆20Updated 5 years ago
- working example of a contextual multi-armed bandit☆55Updated 6 years ago