shuaili8 / Bandit_book_solutionsLinks
☆13Updated 3 years ago
Alternatives and similar repositories for Bandit_book_solutions
Users that are interested in Bandit_book_solutions are comparing it to the libraries listed below
Sorting:
- The official code for ROLeR from CIKM 2024☆7Updated 7 months ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 4 years ago
- ☆11Updated 4 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆18Updated 2 years ago
- mcc_demo☆10Updated 3 years ago
- ☆35Updated 5 years ago
- ☆17Updated 3 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆95Updated 3 years ago
- ☆14Updated 5 years ago
- WIP implementation of https://arxiv.org/pdf/1901.08162.pdf☆9Updated 5 years ago
- python implementation of the TPGR☆39Updated 6 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Updated 5 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆21Updated 4 years ago
- ☆25Updated 3 years ago
- ☆27Updated 5 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Updated last year
- Experiments codes for SIGIR '20 paper "A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data"☆32Updated 5 years ago
- A toolkit of Reinforcement Learning based Recommendation (RL4Rec)☆23Updated 3 years ago
- ☆106Updated 4 years ago
- ☆13Updated 3 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Updated 5 years ago
- Implementation of Optimal Auctions through Deep Learning☆128Updated 5 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 4 years ago
- Offline evaluation of multi-armed bandit algorithms☆23Updated 4 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆23Updated 3 years ago
- ☆12Updated last month
- ☆18Updated 4 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆20Updated 6 years ago