wangbx66 / contextual-banditLinks

Contextual Combinatorial Cascading Bandits

☆10

Alternatives and similar repositories for contextual-bandit

Users that are interested in contextual-bandit are comparing it to the libraries listed below

Sorting:

HCDM / BanditLib
Library of contextual bandits algorithms
☆333Updated last year
lilianweng / multi-armed-bandit
Play with the solutions to the multi-armed-bandit problem.
☆415Updated last year
ntucllab / striatum
Contextual bandit in python
☆114Updated 4 years ago
SoluMilken / Contextual-Bandit
Contextual Bandit Algorithms (+Bandit Algorithms)
☆22Updated 5 years ago
bgalbraith / bandits
Python library for Multi-Armed Bandits
☆751Updated 5 years ago
niffler92 / Bandit
Bandit algorithms
☆30Updated 7 years ago
saisrivatsan / deep-opt-auctions
Implementation of Optimal Auctions through Deep Learning
☆128Updated 5 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆95Updated 3 years ago
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆407Updated last year
iosband / ts_tutorial
☆362Updated 4 years ago
KKeishiro / Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
☆112Updated 7 years ago
uclaml / NeuralUCB
☆35Updated 5 years ago
Hanjun-Dai / graph_comb_opt
Implementation of "Learning Combinatorial Optimization Algorithms over Graphs"
☆505Updated 6 years ago
Hanjun-Dai / graphnn
Training computational graph on top of structured data (string, graph, etc)
☆289Updated 4 years ago
higgsfield / np-hard-deep-reinforcement-learning
pytorch neural combinatorial optimization
☆385Updated 7 years ago
devsisters / pointer-network-tensorflow
TensorFlow implementation of "Pointer Networks"
☆475Updated 8 years ago
annieyan / Bandits-using-UCB-algorithm
Thompson Sampling for Bandits using UCB policy
☆10Updated 7 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆177Updated 7 years ago
han-cai / rlb-dp
Real-Time Bidding by Reinforcement Learning in Display Advertising
☆183Updated 4 years ago
criteo-research / reco-gym
Code for reco-gym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
☆475Updated 4 years ago
pemami4911 / neural-combinatorial-rl-pytorch
PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940
☆583Updated 7 years ago
timnugent / bandit-algorithms
Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]
☆50Updated 6 years ago
Networks-Learning / netrate
Netrate Matlab/CVX
☆10Updated 12 years ago
dawenl / expo-mf
Exposure Matrix Factorization: modeling user exposure in recommendation
☆96Updated 9 years ago
Hanjun-Dai / pytorch_structure2vec
pytorch implementation of structure2vec (https://arxiv.org/abs/1603.05629)
☆311Updated 6 years ago
chenhaokun / TPGR
python implementation of the TPGR
☆39Updated 6 years ago
JianGuanTHU / IRecGAN
Implementation for our paper in NeurIPS 2019
☆48Updated 5 years ago
ymy4323460 / HATCH
☆38Updated 3 years ago
guyulongcs / Awesome-Deep-Reinforcement-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Reinforcement Learning papers for industrial Search, Recommendation and Advertising.
☆206Updated 4 years ago
Alanthink / banditpylib
A lightweight python library for bandit algorithms
☆30Updated 2 years ago