ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
☆17Updated 3 years ago
Alternatives and similar repositories for Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems:
Users that are interested in Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems are comparing it to the libraries listed below
- ☆13Updated 2 years ago
- ☆30Updated 4 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆29Updated 6 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆84Updated 4 years ago
- Offline evaluation of multi-armed bandit algorithms☆22Updated 4 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆92Updated 3 years ago
- Bandit algorithms simulations for online learning☆83Updated 4 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆132Updated 2 years ago
- ☆31Updated 3 years ago
- ☆14Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Code for the experiments of Matrix Factorization Bandit☆24Updated 6 years ago
- Bayesian Optimization Meets Bayesian Optimal Stopping☆31Updated 4 years ago
- ☆27Updated 5 months ago
- Code and data for decision making under strategic behavior, NeurIPS 2020 & Management Science 2024.☆28Updated last year
- ☆11Updated 4 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 3 years ago
- Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volat…☆18Updated 5 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 5 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆10Updated 2 years ago
- ☆15Updated last year
- Implementing LinUCB and HybridLinUCB in Python.☆48Updated 6 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆23Updated 5 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆20Updated 3 years ago
- Code for the WSDM '20 paper, Learning Individual Causal Effects from Networked Observational Data.☆74Updated 3 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- ☆30Updated 4 years ago
- A python implementation of Dueling Bandit Gradient Descent (DBGD)☆23Updated 6 years ago
- Code for "Counterfactual Explanations in Sequential Decision Making Under Uncertainty", NeurIPS 2021☆16Updated 2 years ago
- Implementation of various multi-armed bandits algorithms on a 10-arm testbed.☆38Updated 5 years ago