jonnedtc / Multi-Armed-Bandits
Bootstrap (Linear) Thompson Sampling
☆13Updated 8 years ago
Related projects: ⓘ
- Multi-Arm Bandits for online recommendations via Particle Thompson Sampling with Probabilistic Matrix Factorization☆14Updated 6 years ago
- This is a paper list for recent studies on optimization algorithms.☆12Updated 6 years ago
- A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm☆27Updated 2 years ago
- ☆11Updated 5 years ago
- A training and testing framework supporting experiments in CIKM 2016 paper "User Response Learning for Directly Optimizing Campaign Perfo…☆25Updated 6 years ago
- LambdaFM: Learning Optimal Ranking with Factorization Machines Using Lambda Surrogates☆18Updated 5 years ago
- Experimental study on team formation problem.☆7Updated 9 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆22Updated last year
- Code for paper "On Sampling Strategies for Neural Network-based Collaborative Filtering"☆40Updated 6 years ago
- Linear Recommender AAAI-16 code☆20Updated 6 years ago
- ☆16Updated 7 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆30Updated 6 years ago
- Stream Data based News Recommendation - Contextual Bandit Approach☆49Updated 6 years ago
- Toy implementation of SLIM and SSLIM Recommendation methods.☆41Updated 6 years ago
- Deep Learning for Recommendation☆35Updated 7 years ago
- ☆11Updated 5 years ago
- ☆16Updated 3 years ago
- items browsed in a session as a context are modeled to vec with bidirectional lstm☆18Updated 7 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated last year
- Matlab Implementation of the Local Collective Embeddings model☆13Updated 10 years ago
- This repository contains the scripts used during my participation on CIKM Cup 2016 (see http://cikmcup.org/ and https://competitions.coda…☆11Updated 7 years ago
- Hybrid Linear UCB bandit learning algorithm L Li(2010) python code☆56Updated 8 years ago
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆27Updated 4 years ago
- ☆10Updated 2 years ago
- Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendati…☆21Updated 5 months ago
- ☆13Updated 8 years ago
- A python implementation of Dueling Bandit Gradient Descent (DBGD)☆21Updated 5 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Updated 6 years ago
- ☆20Updated this week
- ☆17Updated 7 years ago