Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
Alternatives and similar repositories for best-arm-delayed
Users that are interested in best-arm-delayed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 8 years ago
- Online Variance Reduction☆15May 9, 2019Updated 7 years ago
- Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting☆11Mar 24, 2023Updated 3 years ago
- Code for "Modeling Sparse Deviations for Compressed Sensing using Generative Models", ICML 2018☆24Jul 5, 2018Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]☆12Jun 27, 2023Updated 2 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- ☆27May 17, 2019Updated 6 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- ☆41Aug 24, 2018Updated 7 years ago
- Java implementation of Thompson sampling to solve the multi-armed bandit problem☆30Jun 14, 2023Updated 2 years ago
- Analyzes and adjusts the volume of MP3 files☆12Apr 7, 2019Updated 7 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Jul 4, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of original Benders procedures in Python☆10Apr 6, 2019Updated 7 years ago
- pyrff: Python implementation of random fourier feature approximations for gaussian processes☆28Updated this week
- ☆12Jul 3, 2021Updated 4 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆24May 12, 2019Updated 6 years ago
- Fair Generative Modeling via Weak Supervision☆21Nov 22, 2022Updated 3 years ago
- ☆15Feb 19, 2025Updated last year
- Notes for short course on econometrics in Stan☆13Jun 17, 2017Updated 8 years ago
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code to study the generalisability of benchmark models on non-stationary EHRs.☆15Aug 7, 2019Updated 6 years ago
- PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data☆12May 19, 2019Updated 6 years ago
- Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…☆23Dec 12, 2021Updated 4 years ago
- Hypothesis testing (Parametric/Non-Parametric)☆11Oct 8, 2019Updated 6 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- Book code for Testing in Scala on O'Reilly☆14May 29, 2014Updated 11 years ago
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Nov 4, 2017Updated 8 years ago
- This repository contains PyTorch implemenation of WWW 2023 research paper: Optimizing Feature Set for Click-through Rate Prediction.☆12Oct 23, 2023Updated 2 years ago
- online learning for time series prediction☆13May 17, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of my Bayesian Optimization algorithms☆12Mar 17, 2018Updated 8 years ago
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- ☆17May 16, 2018Updated 7 years ago
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Feb 19, 2019Updated 7 years ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 8 years ago
- Scientific-Computing-with-Scala_Code☆16Jan 30, 2023Updated 3 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 7 years ago