gdmarmerola / interactive-intro-rlLinks

Big Data's open seminars: An Interactive Introduction to Reinforcement Learning

☆64

Alternatives and similar repositories for interactive-intro-rl

Users that are interested in interactive-intro-rl are comparing it to the libraries listed below

Sorting:

kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆88Updated 5 years ago
cemoody / simple_mf
Simple but Flexible Recommendation Engine in PyTorch
☆133Updated 3 years ago
jldbc / bandits
Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset
☆56Updated 4 years ago
Ibotta / mr_uplift
Multiple Response Uplift (or heterogeneous treatment effects) package that builds and evaluates tradeoffs with multiple treatments and mu…
☆69Updated 3 months ago
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆66Updated 4 years ago
ntucllab / striatum
Contextual bandit in python
☆114Updated 4 years ago
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆133Updated 2 years ago
usaito / recsys2021-tutorial
https://sites.google.com/cornell.edu/recsys2021tutorial
☆55Updated 3 years ago
gdmarmerola / advanced-bandit-problems
More about the exploration-exploitation tradeoff with harder bandits
☆24Updated 6 years ago
criteo-research / bandit-reco
☆50Updated 4 years ago
rk2900 / deep-conv-attr
An implementation of our CIKM 2018 paper "Deep Conversion Attribution with Dual-attention Recurrent Neural Network"
☆62Updated 6 years ago
BartyzalRadek / contextual-bandits-recommender
Implementing LinUCB and HybridLinUCB in Python.
☆49Updated 7 years ago
antonismand / Personalized-News-Recommendation
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
☆100Updated 3 years ago
usaito / counterfactual-cv
(ICML2020) “Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models’’
☆31Updated 2 years ago
HCDM / BanditLib
Library of contextual bandits algorithms
☆334Updated last year
daddydrac / Contextual-Multi-Armed-Bandits
☆36Updated 6 years ago
spotify-research / RIPS_KDD2020
☆18Updated 4 years ago
amazon-science / auction-gym
AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online adver…
☆174Updated last month
deezer / carousel_bandits
Source code and data from the RecSys 2020 article "Carousel Personalization in Music Streaming Apps with Contextual Bandits" by W. Bendad…
☆57Updated 4 years ago
Kenza-AI / mab-ranking
Online Ranking with Multi-Armed-Bandits
☆18Updated 3 years ago
adith387 / slates_semisynth_expts
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Updated 7 years ago
allenday / contextual-bandit
working example of a contextual multi-armed bandit
☆54Updated 5 years ago
jrzaurin / RecoTour
A tour through recommendation algorithms in python [IN PROGRESS]
☆177Updated 7 months ago
umeshksingla / news-recommend-ire
Stream Data based News Recommendation - Contextual Bandit Approach
☆48Updated 7 years ago
akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆87Updated 4 years ago
causal-machine-learning / kdd2021-tutorial
EconML/CausalML KDD 2021 Tutorial
☆161Updated last year
fidelity / mabwiser
[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library
☆251Updated 11 months ago
google-research / recsim_ng
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
☆120Updated 3 years ago
Netflix-Skunkworks / rl_for_budget_constrained_recs
☆44Updated 2 years ago
brian-c-ogorman / ABanditTesting
No Regrets: A deep dive comparison of bandits and A/B testing
☆47Updated 7 years ago