sgiguere / RobinHood-NeurIPS-2019
Implementation of safe offline bandit algorithms.
☆9Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for RobinHood-NeurIPS-2019
- Implementation of provably Rawlsian fair ML algorithms for contextual bandits.☆14Updated 7 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated last year
- Estimators to perform off-policy evaluation☆13Updated 2 months ago
- Accelerated Confergence for Counterfactual Learning to Rank☆16Updated 2 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆22Updated last year
- Empirical Likelihood for Contextual Bandits☆12Updated 4 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Updated 3 years ago
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- Fair Benchmarks☆10Updated 5 years ago
- ☆11Updated 6 years ago
- Birkhoff decomposition for doubly stochastic matrices.☆15Updated last year
- ☆16Updated 4 years ago
- This repository contains the scripts used during my participation on CIKM Cup 2016 (see http://cikmcup.org/ and https://competitions.coda…☆11Updated 8 years ago
- ☆14Updated 3 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- Python code for implementing embeddings in the Wasserstein space of elliptical distributions☆10Updated 4 years ago
- ☆21Updated 3 years ago
- Tools for robustness evaluation in interpretability methods☆11Updated 3 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆14Updated 4 years ago
- Code for "Boosted Generative Models", AAAI 2018.☆20Updated 6 years ago
- Batch IS NOT Heavy: Learning Word Representations From All Samples☆10Updated 6 years ago
- Code for the RecSys20 paper -- Unbiased Implicit Recommendation and Propensity Estimation via Combinational Joint Learning☆10Updated 4 years ago
- TF-Tile: an efficient sparse representation for real-valued data☆13Updated last year
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆11Updated 3 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- ☆10Updated 4 years ago
- Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.☆16Updated 3 years ago
- Code for paper by Bamler & Mandt, "Extreme Classification via Adversarial Softmax Approximation" (ICLR 2020)☆14Updated 4 years ago
- ☆11Updated 2 years ago