lilianweng / multi-armed-banditLinks

Play with the solutions to the multi-armed-bandit problem.

☆410

Alternatives and similar repositories for multi-armed-bandit

Users that are interested in multi-armed-bandit are comparing it to the libraries listed below

Sorting:

HCDM / BanditLib
Library of contextual bandits algorithms
☆333Updated last year
david-cortes / contextualbandits
Python implementations of contextual bandits algorithms
☆789Updated last week
bgalbraith / bandits
Python library for Multi-Armed Bandits
☆750Updated 5 years ago
criteo-research / reco-gym
Code for reco-gym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
☆476Updated 3 years ago
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆407Updated last year
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆133Updated 2 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆95Updated 3 years ago
akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆87Updated 4 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆86Updated 5 years ago
ntucllab / striatum
Contextual bandit in python
☆114Updated 3 years ago
google-research / recsim
A Configurable Recommender Systems Simulation Platform
☆762Updated 3 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆177Updated 7 years ago
johnmyleswhite / BanditsBook
Code for my book on Multi-Armed Bandit Algorithms
☆912Updated 5 years ago
guyulongcs / Awesome-Deep-Reinforcement-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Reinforcement Learning papers for industrial Search, Recommendation and Advertising.
☆206Updated 4 years ago
sfujim / BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆632Updated 4 years ago
cszhangzhen / DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
☆300Updated 2 years ago
KKeishiro / Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
☆113Updated 7 years ago
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆67Updated 4 years ago
antonismand / Personalized-News-Recommendation
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
☆101Updated 3 years ago
arowdy98 / Stanford-CS234
Assignment Solutions to CS234: Reinforcement learning course
☆36Updated 6 years ago
xuwd11 / cs294-112_hws
My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning
☆92Updated 6 years ago
Shmuma / ptan
PyTorch Agent Net: reinforcement learning toolkit for pytorch
☆546Updated 7 months ago
dalmia / David-Silver-Reinforcement-learning
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
☆817Updated 3 years ago
SahanaRamnath / MultiArmedBandit_RL
Implementation of various multi-armed bandits algorithms on a 10-arm testbed.
☆38Updated 5 years ago
kulinshah98 / Multi-Armed-Bandit-Algorithms
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
☆28Updated 6 years ago
dongminlee94 / deep_rl
PyTorch implementation of deep reinforcement learning algorithms
☆494Updated 3 years ago
JKCooper2 / rlai-exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
☆107Updated 2 years ago
RITCHIEHuang / DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DD…
☆334Updated 2 years ago
Huixxi / CS234-Reinforcement-Learning-Winter-2019
My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019
☆169Updated 2 years ago
iosband / ts_tutorial
☆361Updated 4 years ago