gdmarmerola / advanced-bandit-problemsLinks

More about the exploration-exploitation tradeoff with harder bandits

☆24

Alternatives and similar repositories for advanced-bandit-problems

Users that are interested in advanced-bandit-problems are comparing it to the libraries listed below

Sorting:

gdmarmerola / interactive-intro-rl
Big Data's open seminars: An Interactive Introduction to Reinforcement Learning
☆63Updated 4 years ago
akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆89Updated 5 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆99Updated 4 years ago
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆133Updated 3 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆89Updated 5 years ago
andrecianflone / thompson
Thompson Sampling Tutorial
☆55Updated 6 years ago
usaito / counterfactual-cv
(ICML2020) “Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models’’
☆31Updated 2 years ago
lilianweng / multi-armed-bandit
Play with the solutions to the multi-armed-bandit problem.
☆415Updated last year
HCDM / BanditLib
Library of contextual bandits algorithms
☆336Updated last year
jldbc / bandits
Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset
☆57Updated 5 years ago
criteo-research / optimization-continuous-action-crm
☆30Updated 5 years ago
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆416Updated last year
SoluMilken / Contextual-Bandit
Contextual Bandit Algorithms (+Bandit Algorithms)
☆22Updated 6 years ago
GitHubLuCheng / LTEE
Implementation of paper Long-Term Effect Estimation with Surrogate Representation
☆13Updated 5 years ago
adith387 / slates_semisynth_expts
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Updated 8 years ago
Networks-Learning / hdhp.py
Code for "Hierarchical Dirichlet-Hawkes process: generative model and inference algorithm", WWW 2017
☆36Updated 7 years ago
jhartford / DeepIV
Implementation of Deep IV: A Flexible Approach for Counterfactual Prediction
☆161Updated 4 years ago
ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
☆17Updated 4 years ago
dquail / NonStationaryBandit
Non stationary bandit for experiments with Reinforcement Learning
☆33Updated 8 years ago
IBM-HRL-MLHLS / IBM-Causal-Inference-Benchmarking-Framework
Data derived from the Linked Births and Deaths Data (LBIDD); simulated pairs of treatment assignment and outcomes; scoring code
☆84Updated 7 years ago
gpleiss / equalized_odds_and_calibration
Code and data for the experiments in "On Fairness and Calibration"
☆51Updated 3 years ago
debmandal / RL-Causality
References at the Intersection of Causality and Reinforcement Learning
☆90Updated 5 years ago
antonismand / Personalized-News-Recommendation
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
☆99Updated 4 years ago
ijmbarr / notes-on-causal-inference
Some notes on Causal Inference, with examples in python
☆154Updated 5 years ago
ntucllab / striatum
Contextual bandit in python
☆112Updated 4 years ago
charmlab / mace
Model Agnostic Counterfactual Explanations
☆88Updated 3 years ago
rjagerman / wsdm2019-nonstationary
Non-stationary Off-policy Evaluation
☆13Updated 7 years ago
olivierjeunen / decision-theory-www-2021
Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).
☆11Updated 4 years ago
usaito / dr-ranking-metric
(RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"
☆24Updated 2 years ago
CausalML / continuous-policy-learning
☆12Updated 6 years ago