VowpalWabbit / cobaLinks
Contextual bandit benchmarking
☆50Updated last month
Alternatives and similar repositories for coba
Users that are interested in coba are comparing it to the libraries listed below
Sorting:
- Estimators to perform off-policy evaluation☆13Updated 10 months ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- ☆27Updated 7 years ago
- ☆30Updated 5 years ago
- scripts for evaluation of contextual bandit algorithms☆45Updated 5 years ago
- Online Ranking with Multi-Armed-Bandits☆18Updated 3 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- Experimentation for oracle based contextual bandit algorithms.☆32Updated 2 years ago
- Empirical Likelihood for Contextual Bandits☆12Updated 4 years ago
- Library for Multi-Armed Bandit Algorithms☆58Updated 8 years ago
- ☆42Updated 6 years ago
- Contextual bandit in python☆114Updated 4 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Updated 7 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆66Updated 4 years ago
- An extension to Sacred for automated hyperparameter optimization.☆59Updated 7 years ago
- Reinforcement learning course at Data Science Retreat☆42Updated 6 years ago
- Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)☆43Updated 9 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Updated 5 years ago
- Recommendation models that use binary rather than floating point operations at prediction time.☆21Updated 7 years ago
- Exponential family embeddings (Poisson or Bernoulli) for discrete data☆32Updated 6 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Updated 4 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10Updated 6 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Updated 8 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆23Updated 2 years ago
- Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update☆75Updated 8 months ago
- Implementation of provably Rawlsian fair ML algorithms for contextual bandits.☆14Updated 8 years ago
- An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.☆83Updated 3 years ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆42Updated 5 years ago