jkomiyama / duelingbanditlibLinks

☆11

Alternatives and similar repositories for duelingbanditlib

Users that are interested in duelingbanditlib are comparing it to the libraries listed below

Sorting:

ardaegeunlu / X-armed-Bandits
Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
☆9Updated 7 years ago
Alanthink / banditpylib
A lightweight python library for bandit algorithms
☆30Updated 2 years ago
tor / libbandit
Library for Multi-Armed Bandit Algorithms
☆58Updated 8 years ago
JournalMLR / jmlr-style-file
LaTeX style file for the Journal of Machine Learning Research
☆9Updated 5 years ago
v-i-s-h / MAB.jl
A Julia Package for providing Multi Armed Bandit Experiments
☆21Updated 6 years ago
iurteaga / bandits
Public repository for the work on bandit problems
☆23Updated last year
j-wang / BanditEmpirical
Empirical tests of various bandit algorithms.
☆16Updated 10 years ago
jkomiyama / multiplaybanditlib
☆20Updated 9 years ago
UCLA-StarAI / LearnPSDD
☆15Updated 6 years ago
zalanborsos / online-variance-reduction
Online Variance Reduction
☆14Updated 6 years ago
dquail / NonStationaryBandit
Non stationary bandit for experiments with Reinforcement Learning
☆34Updated 8 years ago
HuasenWu / DuelingBandits
Simulations for Dueling Bandit Algorithms, including our Double Thompson Sampling (D-TS) algorithms
☆25Updated 8 years ago
rjagerman / wsdm2019-nonstationary
Non-stationary Off-policy Evaluation
☆13Updated 6 years ago
abietti / cb_bakeoff
scripts for evaluation of contextual bandit algorithms
☆45Updated 5 years ago
duvenaud / herding-paper
Optimally-weighted herding is Bayesian Quadrature
☆16Updated 9 years ago
econtal / gp-optimization-python
Implementation of my Bayesian Optimization algorithms
☆12Updated 7 years ago
mim / igmm
Infinite Gaussian Mixture Model
☆31Updated 9 years ago
agentmodels / agentmodels.org
Modeling agents with probabilistic programs
☆67Updated 5 years ago
matejbalog / gumbel-relatives
Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick
☆17Updated 8 years ago
ermongroup / best-arm-delayed
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆19Updated 7 years ago
alexrutar / banditvis
A Python 3 Bandit Visualization Package
☆11Updated 7 years ago
51alg / TerpreT
☆43Updated 7 years ago
mlresearch / mlresearch.github.io
Machine Learning Research Homepage
☆43Updated 5 months ago
CW-Huang / BayesianHypernet
☆17Updated 7 years ago
BigBayes / PosteriorServer
Posterior Server
☆15Updated 8 years ago
hasktorch / ffi-experimental
(DEPRECATED, migrated to main repo - hasktorch/hasktorch) Research code generation / FFI binding using libtorch 1.x for the next Hasktor…
☆11Updated 5 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
pyro-ppl / pyro-models
Repository of models in Pyro
☆29Updated 11 months ago
przchojecki / deepalgebra
DeepAlgebra
☆25Updated 7 years ago
HIPS / maxwells-daemon
Fastidious accounting of entropy streams into and out of optimization and sampling algorithms.
☆33Updated 9 years ago