j2kun / exp3Links

Python code for the post "Adversarial Bandits and the Exp3 Algorithm"

☆51

Alternatives and similar repositories for exp3

Users that are interested in exp3 are comparing it to the libraries listed below

Sorting:

rayshi14 / HybridLinUCB-python
Hybrid Linear UCB bandit learning algorithm L Li(2010) python code
☆56Updated 9 years ago
qingyun-wu / CoLinUCB_Revised
☆84Updated 6 years ago
ntucllab / striatum
Contextual bandit in python
☆114Updated 4 years ago
timnugent / bandit-algorithms
Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]
☆50Updated 6 years ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆24Updated 4 years ago
Jinjiarui / rl4rs-papers
A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.
☆72Updated 5 years ago
ruhan / toyslim
Toy implementation of SLIM and SSLIM Recommendation methods.
☆42Updated 7 years ago
BartyzalRadek / contextual-bandits-recommender
Implementing LinUCB and HybridLinUCB in Python.
☆50Updated 7 years ago
dongx-duan / bpr
☆42Updated 9 years ago
ymy4323460 / HATCH
☆38Updated 3 years ago
HCDM / BanditLib
Library of contextual bandits algorithms
☆333Updated last year
j-wang / BanditEmpirical
Empirical tests of various bandit algorithms.
☆16Updated 10 years ago
tjuHaoXiaotian / ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
☆26Updated 4 years ago
ustcljb / topK-off-policy-correction-REINFORCE
☆18Updated 4 years ago
chentingpc / NNCF
Code for paper "On Sampling Strategies for Neural Network-based Collaborative Filtering"
☆39Updated 7 years ago
insuhan / fastdppmap
☆18Updated 8 years ago
rk2900 / DLF
Deep learning for flexible market price modeling (landscape forecasting) in real-time bidding advertising. An implementation of our KDD 2…
☆72Updated 4 years ago
collinprather / SlateQ
A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms
☆37Updated 2 years ago
yihong-chen / lambda-opt
Pytorch implementation of λOpt: Learn to Regularize Recommender Models in Finer Levels, KDD 2019
☆53Updated 5 years ago
rayshi14 / LinearUCB-python
Linear UCB bandit learning algorithm L Li(2010) python code
☆19Updated 10 years ago
han-cai / rlb-dp
Real-Time Bidding by Reinforcement Learning in Display Advertising
☆183Updated 4 years ago
AaronJi / RL
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
☆27Updated 3 years ago
KKeishiro / Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
☆112Updated 7 years ago
QingyaoAi / Unbiased-Learning-to-Rank-with-Unbiased-Propensity-Estimation
This is an implementation of the Dual Learning Algorithm with multi-layer feed-forward neural network for online unbiased learning to ran…
☆89Updated 2 years ago
wnzhang / make-ipinyou-data
This project is to formalise the iPinYou RTB data into a standard format for further researches.
☆128Updated 2 years ago
massquantity / Ftrl-FFM
Field-aware factorization machine (FFM) with FTRL
☆46Updated last year
rec-agent / rec-rl
☆55Updated 5 years ago
zoulixin93 / FMCTS
☆11Updated 6 years ago
Atomu2014 / Ads-RecSys-Datasets
This repository collects some datasets for Ads & RecSys uses, and provide easy-to-use hdf5 iterative access.
☆90Updated 4 years ago
Atomu2014 / product-nets-distributed
distributed version of product-nets
☆82Updated 5 years ago