akhadangi / Multi-armed-BanditsLinks

In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandits) and Kernel UCB. Some of the well cited papers in this context are also implemented.

☆89

Alternatives and similar repositories for Multi-armed-Bandits

Users that are interested in Multi-armed-Bandits are comparing it to the libraries listed below

Sorting:

sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆99Updated 4 years ago
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆133Updated 3 years ago
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆70Updated 4 years ago
andrecianflone / thompson
Thompson Sampling Tutorial
☆55Updated 6 years ago
uclaml / NeuralUCB
☆48Updated 5 years ago
lilianweng / multi-armed-bandit
Play with the solutions to the multi-armed-bandit problem.
☆415Updated last year
Lunj12 / RL-Bandits-with-Knapsacks
Dynamic Pricing BwK Problem and Reinforcement Learning
☆31Updated 7 years ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆25Updated 5 years ago
orrivlin / MinimumVertexCover_DRL
Learning to solve Minimum Vertex Cover using Graph Convolutional Networks and RL
☆77Updated 6 years ago
antonismand / Personalized-News-Recommendation
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
☆99Updated 4 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆89Updated 5 years ago
saisrivatsan / deep-opt-auctions
Implementation of Optimal Auctions through Deep Learning
☆134Updated 6 years ago
BartyzalRadek / contextual-bandits-recommender
Implementing LinUCB and HybridLinUCB in Python.
☆49Updated 7 years ago
sadighian / recommendation-gym
MovieLens recommendation system using reinforcement learning (GYM + PPO)
☆50Updated 5 years ago
mktal / kddcup-starting-kit
The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.
☆92Updated 4 years ago
SahanaRamnath / MultiArmedBandit_RL
Implementation of various multi-armed bandits algorithms on a 10-arm testbed.
☆38Updated 5 years ago
HCDM / BanditLib
Library of contextual bandits algorithms
☆336Updated last year
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆178Updated 7 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 6 years ago
XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation
A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.
☆40Updated 6 years ago
meowoodie / Reinforcement-Learning-of-Spatio-Temporal-Point-Processes
A general framework for learning spatio-temporal point processes via reinforcement learning
☆30Updated 4 years ago
guyulongcs / Awesome-Deep-Reinforcement-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Reinforcement Learning papers for industrial Search, Recommendation and Advertising.
☆217Updated 4 years ago
JianGuanTHU / IRecGAN
Implementation for our paper in NeurIPS 2019
☆48Updated 6 years ago
swisscom / ai-research-mamo-framework
A Model Agnostic Multi-Objective Framework for Deep Learning models
☆32Updated 5 years ago
backgom2357 / Recommender_system_via_deep_RL
The implemetation of Deep Reinforcement Learning based Recommender System from the paper Deep Reinforcement Learning based Recommendation…
☆123Updated 2 years ago
Jinjiarui / rl4rs-papers
A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.
☆72Updated 5 years ago
isl-org / NPHard
Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search
☆155Updated last year
criteo-research / optimization-continuous-action-crm
☆30Updated 5 years ago
xinshi-chen / GenerativeAdversarialUserModel
Tensorflow implementation for "Generative Adversarial User Model forReinforcement Learning Based Recommendation System"
☆131Updated 6 years ago
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆416Updated last year