shuaili8 / Bandit_book_solutions
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Bandit_book_solutions
- ☆11Updated 4 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 3 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- ☆14Updated 4 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆19Updated last year
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆10Updated 5 years ago
- ☆97Updated 3 years ago
- python implementation of the TPGR☆39Updated 5 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆19Updated 2 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆20Updated 3 years ago
- ☆18Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆27Updated 4 years ago
- ☆12Updated 11 months ago
- Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volat…☆16Updated 4 years ago
- Baseline for NeurIPS_Auto_Bidding_General_Track☆24Updated 3 months ago
- Implementation of Optimal Auctions through Deep Learning☆115Updated 5 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆68Updated 4 years ago
- ☆15Updated 3 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Implementation for our paper in NeurIPS 2019☆47Updated 4 years ago
- ☆118Updated 4 months ago
- Kuaishou Online RL Benchmark☆18Updated last year
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆14Updated 2 years ago
- ☆12Updated last year
- ☆33Updated 2 months ago
- ☆17Updated 2 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 6 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆37Updated 3 years ago
- WIP implementation of https://arxiv.org/pdf/1901.08162.pdf☆9Updated 4 years ago