shuaili8 / Bandit_book_solutions
☆13Updated 2 years ago
Alternatives and similar repositories for Bandit_book_solutions:
Users that are interested in Bandit_book_solutions are comparing it to the libraries listed below
- ☆11Updated 4 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 3 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- ☆29Updated 4 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆15Updated 2 years ago
- ☆14Updated 4 years ago
- python implementation of the TPGR☆39Updated 5 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆10Updated 5 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆20Updated 4 years ago
- ☆18Updated 5 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 6 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆19Updated 3 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆91Updated 3 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- Theory of Reinforcement Learning☆16Updated 3 years ago
- ☆97Updated 3 years ago
- ☆26Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- WIP implementation of https://arxiv.org/pdf/1901.08162.pdf☆9Updated 4 years ago
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆57Updated last year
- ☆85Updated 5 months ago
- Deconfounding Reinforcement Learning in Observational Settings☆48Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- Must-read papers on Reinforcement Learning (RL)☆41Updated 4 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆37Updated 3 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- ☆25Updated 3 years ago
- My research paper notes, focusing on data mining/recommender/reinforcement learning. 我的论文笔记,主要聚焦于数据挖掘、推荐系统、强化学习☆19Updated 3 years ago