ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
☆17Updated 3 years ago
Alternatives and similar repositories for Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems:
Users that are interested in Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems are comparing it to the libraries listed below
- ☆13Updated 2 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆29Updated 6 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆82Updated 4 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆91Updated 3 years ago
- Bandit algorithms simulations for online learning☆83Updated 4 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆20Updated 3 years ago
- ☆29Updated 4 years ago
- Code for the WSDM '20 paper, Learning Individual Causal Effects from Networked Observational Data.☆74Updated 3 years ago
- Package for building Market Segmentation Trees, Choice Model Trees, and Isotonic Regression Trees☆16Updated last year
- ☆17Updated 3 years ago
- Offline evaluation of multi-armed bandit algorithms☆22Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- ☆19Updated last year
- Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volat…☆18Updated 4 years ago
- Reinforcement Learning for Uplift Modeling☆12Updated 3 years ago
- This repository contains codes for paper: Generalized Linear Bandits with Local Differential Privacy by Yuxuan Han, Zhipeng Liang, Yang W…☆16Updated 3 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 2 years ago
- [ICLR 2021] Code for: Varying Coefficient Neural Network with Functional Targeted Regularization for Estimating Continuous Treatment Effe…☆72Updated 2 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆24Updated 5 years ago
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆8Updated 3 years ago
- ☆45Updated 2 years ago
- ☆11Updated 4 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆66Updated 3 years ago
- ☆16Updated last year
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆15Updated 2 years ago
- ☆17Updated 3 years ago
- Code for the experiments of Matrix Factorization Bandit☆24Updated 6 years ago
- ☆14Updated 4 years ago
- A Python Package for Non-stationary Online Learning (PyNOL)☆29Updated 10 months ago
- Datasets for Causal-Structure-Learning Repo☆15Updated 4 years ago