j-wang / BanditEmpiricalLinks
Empirical tests of various bandit algorithms.
☆16Updated 10 years ago
Alternatives and similar repositories for BanditEmpirical
Users that are interested in BanditEmpirical are comparing it to the libraries listed below
Sorting:
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆49Updated 6 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- An experiment with Thompson sampling and TD(0) on a grid world variant☆17Updated 11 years ago
- Library for Multi-Armed Bandit Algorithms