RonyAbecidan / Neural-Thompson-SamplingLinks
Study of the paper 'Neural Thompson Sampling' published in October 2020
☆22Updated 2 years ago
Alternatives and similar repositories for Neural-Thompson-Sampling
Users that are interested in Neural-Thompson-Sampling are comparing it to the libraries listed below
Sorting:
- Thompson Sampling Tutorial☆53Updated 6 years ago
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆56Updated 4 years ago
- Bandit algorithms simulations for online learning☆88Updated 5 years ago
- Big Data's open seminars: An Interactive Introduction to Reinforcement Learning☆64Updated 4 years ago
- ☆12Updated 2 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆66Updated 4 years ago
- Customizable RecSys Simulator for OpenAI Gym☆26Updated 3 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆86Updated 4 years ago
- ☆17Updated 3 years ago
- Distributed Machine Learning with Python, published by Packt☆41Updated last year
- Bandit algorithms for dynamic pricing of many products☆42Updated 5 years ago
- Package for building Market Segmentation Trees, Choice Model Trees, and Isotonic Regression Trees☆17Updated 2 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Updated 2 years ago
- A toolkit of Reinforcement Learning based Recommendation (RL4Rec)☆23Updated 3 years ago
- Tutorial on Multi-Objective Recommender Systems @ KDD 2021☆19Updated 2 years ago
- Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset☆101Updated 3 years ago
- Offline evaluation of multi-armed bandit algorithms☆23Updated 4 years ago
- Uplift Modeling for Multiple Treatments☆16Updated 3 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Updated 2 years ago
- ☆15Updated 5 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 6 years ago
- Uplifted Contextual Multi-Armed Bandit☆19Updated 3 years ago
- Code for Conformal Counterfactual Inference under Hidden Confounding (KDD’24)☆11Updated 10 months ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆16Updated last year
- Active Bayesian Causal Inference (Neurips'22)☆58Updated 11 months ago
- A python implementation of Dueling Bandit Gradient Descent (DBGD)☆24Updated 6 years ago
- ☆51Updated last year
- Online Ranking with Multi-Armed-Bandits☆18Updated 3 years ago
- ☆18Updated 4 years ago
- The code repository for "To Copy, or not to Copy; That is a Critical Issue of the Output Softmax Layer in Neural Sequential Recommenders"☆9Updated last year