Public repository for the work on bandit problems
☆24Apr 4, 2024Updated 2 years ago
Alternatives and similar repositories for bandits
Users that are interested in bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- Implementation of multi-armed bandits in Julia☆12Jan 12, 2020Updated 6 years ago
- R package for Multi-Armed Bandit Simulation Study☆38Aug 18, 2017Updated 8 years ago
- ☆11Nov 24, 2021Updated 4 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Mis proyectos de marketing aplicando AI☆10Oct 31, 2025Updated 7 months ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Jul 16, 2023Updated 2 years ago
- This repository contains the scripts used during my participation on CIKM Cup 2016 (see http://cikmcup.org/ and https://competitions.coda…☆11Nov 4, 2016Updated 9 years ago
- Source code for our LBR paper "Closed-Form Models for Collaborative Filtering with Side-Information" published at RecSys 2020.☆15Jul 22, 2021Updated 4 years ago
- ☆15Sep 25, 2020Updated 5 years ago
- Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.☆15Aug 2, 2021Updated 4 years ago
- Estimators to perform off-policy evaluation☆13Sep 3, 2024Updated last year
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Aug 22, 2018Updated 7 years ago
- This repository contains the code used to run generate the data splits, run the hyperparameter tunings, and export the results presented …☆14Jul 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Accelerated Confergence for Counterfactual Learning to Rank☆17Jan 21, 2022Updated 4 years ago
- ☆16May 31, 2017Updated 9 years ago
- The implementation for our paper "Slate-Aware Ranking for Recommendation" accepted by WSDM.23☆16Dec 13, 2022Updated 3 years ago
- This is the implementation code for the WWW2021 paper "Variation Control and Evaluation for Generative Slate Recommendation"☆15Jun 7, 2021Updated 5 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- ☆15Feb 25, 2021Updated 5 years ago
- ☆16May 9, 2022Updated 4 years ago
- (ICTIR2020) "Unbiased Pairwise Learning from Biased Implicit Feedback"☆19Nov 21, 2022Updated 3 years ago
- Empirical tests of various bandit algorithms.☆16Dec 6, 2014Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18Apr 25, 2023Updated 3 years ago
- Code for RecSys'19 paper: Leveraging Post-click Feedback for Content Recommendations☆15Jul 28, 2021Updated 4 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆24May 12, 2019Updated 7 years ago
- Reading list for research topics in intent analysis.☆16Oct 23, 2023Updated 2 years ago
- Cold Start Similar Artists Ranking with Gravity-Inspired Graph Autoencoders (RecSys 2021)☆19Oct 17, 2021Updated 4 years ago
- A Python module for estimating divergence between two sets of samples.☆18Jul 6, 2023Updated 2 years ago
- Code for fitting neural spike trains with nonparametric hidden Markov and semi-Markov models built upon mattjj's PyHSMM framework.☆15Aug 10, 2018Updated 7 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- In-browser OCaml notebooks 🐪☆26Apr 9, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Mar 15, 2019Updated 7 years ago
- scripts for evaluation of contextual bandit algorithms☆46Apr 27, 2020Updated 6 years ago
- Conditional density estimation with neural networks☆35Jan 18, 2025Updated last year
- Variational Bayesian Mixture of Factor Analysers☆24Jan 24, 2015Updated 11 years ago
- Vowpal Wabbit examples and tutorials☆21Jan 20, 2022Updated 4 years ago
- Tutorial for Multi-Stakeholder Recommender Systems☆22Aug 23, 2021Updated 4 years ago
- Code for the experiments of Matrix Factorization Bandit☆24Feb 4, 2019Updated 7 years ago