ermongroup/best-arm-delayed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ermongroup/best-arm-delayed)

ermongroup / best-arm-delayed

Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.

☆20

Alternatives and similar repositories for best-arm-delayed

Users that are interested in best-arm-delayed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 9 years ago
facebookresearch / reward-estimator-iclr
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆11May 8, 2018Updated 8 years ago
zalanborsos / online-variance-reduction
View on GitHub
Online Variance Reduction
☆15May 9, 2019Updated 7 years ago
ishank-juneja / Correlated-AoI-Bandits
View on GitHub
Author's implementation of the paper Correlated Age-of-Information Bandits.
☆13Jun 19, 2021Updated 5 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RonanFR / UCRL
View on GitHub
☆27May 17, 2019Updated 7 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
yuqingd / cusp
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
christophergandrud / LSE-Beamer-Theme
View on GitHub
An Unofficial LSE LaTeX Beamer Theme
☆15Sep 1, 2015Updated 10 years ago
Shallow-Updates-for-Deep-RL / Shallow_Updates_for_Deep_RL
View on GitHub
Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"
☆18Nov 2, 2017Updated 8 years ago
vruvora / reinforcement-learning-kdd
View on GitHub
☆42Aug 24, 2018Updated 7 years ago
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
abietti / cb_bakeoff
View on GitHub
scripts for evaluation of contextual bandit algorithms
☆46Apr 27, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SMPyBandits / SMPyBandits
View on GitHub
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆424Jun 19, 2026Updated last month
Sound-Linux-More / mp3gain
View on GitHub
Analyzes and adjusts the volume of MP3 files
☆12Apr 7, 2019Updated 7 years ago
michaelosthege / pyrff
View on GitHub
pyrff: Python implementation of random fourier feature approximations for gaussian processes
☆29May 4, 2026Updated 2 months ago
FredrikSvenssonUK / tox21_conformal
View on GitHub
☆12Jul 3, 2021Updated 5 years ago
igsor / HDPy
View on GitHub
Heuristic Dynamic Programming with Python
☆14Jul 28, 2014Updated 12 years ago
ermongroup / fairgen
View on GitHub
Fair Generative Modeling via Weak Supervision
☆21Nov 22, 2022Updated 3 years ago
ucr-riple / AndroidSlicer
View on GitHub
AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.
☆14Jul 28, 2019Updated 7 years ago
MLforHealth / MIMIC_Generalisation
View on GitHub
Code to study the generalisability of benchmark models on non-stationary EHRs.
☆15Aug 7, 2019Updated 6 years ago
Alanthink / banditpylib
View on GitHub
A lightweight python library for bandit algorithms
☆30Jul 21, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
khakieconomics / StanEcon
View on GitHub
Discussion for Stan for economists
☆10Mar 29, 2016Updated 10 years ago
zhangjiong724 / autoassist-exp
View on GitHub
Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.
☆14Oct 3, 2022Updated 3 years ago
EarlGlynn / PhysioNet-Sepsis-Challenge
View on GitHub
PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data
☆12May 19, 2019Updated 7 years ago
mewmew / float
View on GitHub
Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…
☆23Dec 12, 2021Updated 4 years ago
danielmisrael / apd
View on GitHub
Official repository for Adaptive Parallel Decoding (APD).
☆20Oct 27, 2025Updated 9 months ago
mainkoon81 / Study-02-AB-Testing
View on GitHub
Hypothesis testing (Parametric/Non-Parametric)
☆11Oct 8, 2019Updated 6 years ago
econtal / gp-optimization-python
View on GitHub
Implementation of my Bayesian Optimization algorithms
☆12Mar 17, 2018Updated 8 years ago
fuyuanlyu / OptFS
View on GitHub
This repository contains PyTorch implemenation of WWW 2023 research paper: Optimizing Feature Set for Click-through Rate Prediction.
☆12Oct 23, 2023Updated 2 years ago
yidarvin / DREAM_DM_starter_code
View on GitHub
Some starter code for training/testing some basic CNN models given our data.
☆10Feb 15, 2017Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
alanalvestech / ubigraph_server
View on GitHub
UbiGraph Server is a system for visualizing dynamic graphs
☆12May 10, 2018Updated 8 years ago
hongyanz / Stackelberg-GAN
View on GitHub
Codes for Stackelberg GAN
☆15Apr 23, 2019Updated 7 years ago
yashpatel5400 / neuropath
View on GitHub
A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…
☆15May 17, 2017Updated 9 years ago
Sue-Hi / NLP-MIMIC-III
View on GitHub
Predicting Unplanned Hospital Readmission Using Natural Language Processing of MIMICIII Discharge Notes
☆12Feb 12, 2019Updated 7 years ago
andyliu42 / Counterfactual_Regret_Minimization_Python
View on GitHub
Counterfactual Regret Minimization (CFR) sample code in Python
☆14Apr 16, 2019Updated 7 years ago
RobRomijnders / bandit
View on GitHub
Implementation of Counterfactual risk minimization
☆26Apr 13, 2017Updated 9 years ago
nnaisense / pgpelib
View on GitHub
A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxi…
☆73Dec 10, 2020Updated 5 years ago