allenday / contextual-bandit
working example of a contextual multi-armed bandit
☆55Updated 5 years ago
Alternatives and similar repositories for contextual-bandit:
Users that are interested in contextual-bandit are comparing it to the libraries listed below
- Contextual bandit in python☆111Updated 3 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Stream Data based News Recommendation - Contextual Bandit Approach☆48Updated 7 years ago
- Experimentation for oracle based contextual bandit algorithms.☆31Updated 2 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆30Updated 7 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 5 years ago
- Linear UCB bandit learning algorithm L Li(2010) python code☆19Updated 10 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- ☆103Updated 3 years ago
- ☆11Updated 6 years ago
- ☆40Updated 7 years ago
- Library of contextual bandits algorithms☆334Updated last year
- Bandit algorithms simulations for online learning☆84Updated 4 years ago
- Big Data's open seminars: An Interactive Introduction to Reinforcement Learning☆63Updated 3 years ago
- scripts for evaluation of contextual bandit algorithms☆45Updated 4 years ago
- Implementing LinUCB and HybridLinUCB in Python.☆47Updated 6 years ago
- ☆42Updated 6 years ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆42Updated 5 years ago
- ☆50Updated 4 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆19Updated 7 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆66Updated 3 years ago
- ☆36Updated 5 years ago
- A multi-armed bandit library for Python☆82Updated 5 years ago
- No Regrets: A deep dive comparison of bandits and A/B testing☆47Updated 7 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆133Updated 2 years ago
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆56Updated 4 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆22Updated last year
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆49Updated 6 years ago
- Bayesian Logistic Regression using Laplace approximations to the posterior.☆47Updated 8 years ago
- Hybrid Linear UCB Multi-arm Bandit library☆14Updated 8 years ago