adith387 / slates_semisynth_exptsLinks
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Updated 7 years ago
Alternatives and similar repositories for slates_semisynth_expts
Users that are interested in slates_semisynth_expts are comparing it to the libraries listed below
Sorting:
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆23Updated 2 years ago
- Contextual bandit in python☆114Updated 4 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Updated 2 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated 2 years ago
- ☆30Updated 5 years ago
- Experimentation for oracle based contextual bandit algorithms.☆32Updated 2 years ago
- Implementation of provably Rawlsian fair ML algorithms for contextual bandits.☆14Updated 8 years ago
- Linear UCB bandit learning algorithm L Li(2010) python code☆19Updated 10 years ago
- Code for Policy Learning for Fairness in Ranking paper at NeurIPS 2019☆20Updated 3 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10Updated 6 years ago
- ☆18Updated 4 years ago
- ☆16Updated 4 years ago
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Updated 5 years ago
- Stream Data based News Recommendation - Contextual Bandit Approach☆48Updated 7 years ago
- ☆66Updated 2 years ago
- ☆16Updated 8 years ago
- ☆35Updated 6 years ago
- (ICML2020) “Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models’’☆31Updated 2 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆37Updated 2 years ago
- Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.☆16Updated 3 years ago
- ☆15Updated 5 years ago
- ☆27Updated 7 years ago
- Fair Benchmarks☆10Updated 6 years ago
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Updated 6 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- This is an implementation of the Dual Learning Algorithm with multi-layer feed-forward neural network for online unbiased learning to ran…☆89Updated 2 years ago
- working example of a contextual multi-armed bandit☆55Updated 5 years ago
- Exponential family embeddings (Poisson or Bernoulli) for discrete data☆32Updated 6 years ago