colby-j-wise / ParticleThompsonSamplingMAB
Multi-Arm Bandits for online recommendations via Particle Thompson Sampling with Probabilistic Matrix Factorization
☆14Updated 6 years ago
Alternatives and similar repositories for ParticleThompsonSamplingMAB:
Users that are interested in ParticleThompsonSamplingMAB are comparing it to the libraries listed below
- Bootstrap (Linear) Thompson Sampling☆13Updated 8 years ago
- Implementing LinUCB and HybridLinUCB in Python.☆47Updated 6 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆30Updated 6 years ago
- Stream Data based News Recommendation - Contextual Bandit Approach☆48Updated 7 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- Code for paper "On Sampling Strategies for Neural Network-based Collaborative Filtering"☆39Updated 7 years ago
- This is a paper list for recent studies on optimization algorithms.☆12Updated 6 years ago
- ☆17Updated 7 years ago
- A training and testing framework supporting experiments in CIKM 2016 paper "User Response Learning for Directly Optimizing Campaign Perfo…☆25Updated 6 years ago
- Implementation of AutoSVD++ (SIGIR 2017)☆15Updated 6 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆34Updated 2 years ago
- Session based Recommendation, RecSys, TensorFlow☆22Updated 6 years ago
- Toy implementation of SLIM and SSLIM Recommendation methods.☆41Updated 6 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Updated last year
- Representation Learning and Pairwise Ranking for Implicit Feedback in Top-N Item Recommendation☆23Updated 7 years ago
- Linear UCB bandit learning algorithm L Li(2010) python code☆18Updated 10 years ago
- ☆11Updated 5 years ago
- Python code for the post "Adversarial Bandits and the Exp3 Algorithm"☆51Updated 4 years ago
- Hybrid Linear UCB bandit learning algorithm L Li(2010) python code☆56Updated 9 years ago
- code for ResSys'18 paper: "Exploring Recommendations Under User-Controlled Data Filtering"☆23Updated 6 years ago
- ☆18Updated 8 years ago
- A python implementation of Dueling Bandit Gradient Descent (DBGD)☆22Updated 6 years ago
- ☆52Updated 5 years ago
- A toolkit of Reinforcement Learning based Recommendation (RL4Rec)☆22Updated 2 years ago
- This is a new deep learning model for recommender system, which we called PHD☆32Updated 6 years ago
- Code for the experiments of Matrix Factorization Bandit☆24Updated 6 years ago
- Implemented SVD, SVD++ and timeSVD++. Can be used on the netflix data to make predictions. Data can be downloaded from https://minnow.noi…☆14Updated 9 years ago
- keras 2.1.4 / tensorflow 1.7.0☆16Updated 6 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago
- ☆16Updated 4 years ago