VowpalWabbit / estimators
Estimators to perform off-policy evaluation
☆13Updated 8 months ago
Alternatives and similar repositories for estimators:
Users that are interested in estimators are comparing it to the libraries listed below
- Empirical Likelihood for Contextual Bandits☆12Updated 4 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆22Updated last year
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Updated 4 years ago
- Online Ranking with Multi-Armed-Bandits☆18Updated 3 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- Contextual bandit benchmarking☆49Updated 8 months ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated last year
- Accelerated Confergence for Counterfactual Learning to Rank☆17Updated 3 years ago
- Exponential family embeddings (Poisson or Bernoulli) for discrete data☆32Updated 5 years ago
- gFM: An Efficient Solver for generalized Factorization Machine☆8Updated 8 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆10Updated 2 years ago
- Python code for implementing embeddings in the Wasserstein space of elliptical distributions☆11Updated 4 years ago
- ☆30Updated 4 years ago
- Implementation of provably Rawlsian fair ML algorithms for contextual bandits.☆14Updated 7 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10Updated 5 years ago
- ☆14Updated 4 years ago
- Experimentation for oracle based contextual bandit algorithms.☆31Updated 2 years ago
- Pyro models and misc examples.☆19Updated 3 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Updated 4 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- TF-Tile: an efficient sparse representation for real-valued data☆14Updated 2 years ago
- Fair Benchmarks☆10Updated 6 years ago
- Birkhoff decomposition for doubly stochastic matrices.☆15Updated last year
- 🧮 Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.☆41Updated 2 years ago
- A reliable leaderboard algorithm for machine learning competitions☆17Updated 9 years ago
- Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)☆43Updated 9 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Updated 2 years ago
- Simple ranking metrics for PyTorch on CPU or GPU☆15Updated 4 years ago