ermongroup / best-arm-delayedView external linksLinks
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆19Apr 3, 2018Updated 7 years ago
Alternatives and similar repositories for best-arm-delayed
Users that are interested in best-arm-delayed are comparing it to the libraries listed below
Sorting:
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- ☆27May 17, 2019Updated 6 years ago
- Online Variance Reduction☆15May 9, 2019Updated 6 years ago
- scripts for evaluation of contextual bandit algorithms☆45Apr 27, 2020Updated 5 years ago
- ☆42Aug 24, 2018Updated 7 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Jan 18, 2019Updated 7 years ago
- Code for "Modeling Sparse Deviations for Compressed Sensing using Generative Models", ICML 2018☆24Jul 5, 2018Updated 7 years ago
- A lightweight python library for bandit algorithms☆30Jul 21, 2022Updated 3 years ago
- A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxi…☆73Dec 10, 2020Updated 5 years ago
- Java implementation of Thompson sampling to solve the multi-armed bandit problem☆30Jun 14, 2023Updated 2 years ago
- pyrff: Python implementation of random fourier feature approximations for gaussian processes☆27Jul 19, 2025Updated 6 months ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆418Apr 30, 2024Updated last year
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- The SOLAR blackbox optimization problem☆16Sep 24, 2025Updated 4 months ago
- Hypothesis testing (Parametric/Non-Parametric)☆11Oct 8, 2019Updated 6 years ago
- The Luhn algorithm is a simple checksum formula used to validate a variety of identification numbers, such as credit card numbers, IMEI n…☆10Dec 4, 2017Updated 8 years ago
- Training for the Olympiad in Informatics, Liceo Scientifico Galilei, Trento, 2018/2019 and 2019/2020☆10Oct 14, 2025Updated 4 months ago
- Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms☆13Apr 3, 2024Updated last year
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Scalable Bayes via Barycenter in Wasserstein Space☆10Sep 7, 2017Updated 8 years ago
- A java library to compute the difference between XML files☆14Oct 23, 2009Updated 16 years ago
- ☆13Nov 18, 2023Updated 2 years ago
- Code accompanying the paper "Semi-Unsupervised Learning with Deep Generative Models: Clustering and Classifying using Ultra-Sparse Labels…☆13Jan 25, 2019Updated 7 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Mar 24, 2023Updated 2 years ago
- ☆10Jan 26, 2016Updated 10 years ago
- SNAP repository for Ringo☆14Jul 25, 2017Updated 8 years ago
- ☆11Dec 19, 2023Updated 2 years ago
- Predicting Unplanned Hospital Readmission Using Natural Language Processing of MIMICIII Discharge Notes☆12Feb 12, 2019Updated 7 years ago
- ☆12Jul 3, 2021Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting☆12Mar 24, 2023Updated 2 years ago
- Adaptation of Simple Approach to Ordinal Classification for sklearn framework☆12May 18, 2022Updated 3 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- This repository contains PyTorch implemenation of WWW 2023 research paper: Optimizing Feature Set for Click-through Rate Prediction.☆12Oct 23, 2023Updated 2 years ago
- Alias-free Bessel function synthesis☆12Dec 12, 2014Updated 11 years ago
- Query and change XKB layout state☆11Oct 20, 2019Updated 6 years ago