usaito / icml2022-mips
(ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings
☆20Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for icml2022-mips
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated last year
- Deconfounding Reinforcement Learning in Observational Settings☆48Updated 5 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆42Updated 4 years ago
- Implementation of paper Long-Term Effect Estimation with Surrogate Representation☆12Updated 4 years ago
- (SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’☆22Updated last year
- ☆17Updated 4 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆23Updated last year
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- ☆12Updated 2 years ago
- ☆37Updated 5 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆20Updated 3 years ago
- Causal Effect Inference for Structured Treatments (SIN) (NeurIPS 2021)☆41Updated 2 years ago
- ☆30Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- Code for the RecSys20 paper -- Unbiased Implicit Recommendation and Propensity Estimation via Combinational Joint Learning☆10Updated 4 years ago
- Code for Policy Learning for Fairness in Ranking paper at NeurIPS 2019☆20Updated 2 years ago
- ☆42Updated 2 years ago
- Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.☆16Updated 3 years ago
- ☆15Updated last year
- Implementation of variational autoencoders for collaborative filtering in PyTorch☆24Updated 5 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Updated last year
- WIP implementation of https://arxiv.org/pdf/1901.08162.pdf☆9Updated 4 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Updated 3 years ago
- A toolkit of Reinforcement Learning based Recommendation (RL4Rec)☆20Updated 2 years ago
- Reimplementation of NOTEARS in Tensorflow☆32Updated last year
- [ WSDM '22 ] On Sampling Collaborative Filtering Datasets☆20Updated 2 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 3 years ago
- (WSDM2020) "Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback"☆30Updated last year
- ☆65Updated 3 months ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago