shuaili8 / Bandit_book_solutionsLinks
☆13Updated 3 years ago
Alternatives and similar repositories for Bandit_book_solutions
Users that are interested in Bandit_book_solutions are comparing it to the libraries listed below
Sorting:
- ☆11Updated 5 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Updated 3 years ago
- Implementation of Optimal Auctions through Deep Learning☆135Updated 6 years ago
- ☆49Updated 5 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 4 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆98Updated 4 years ago
- Personal Repo to keep track of RL papers☆31Updated 4 years ago
- ☆135Updated last year
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- ☆108Updated 4 years ago
- ☆25Updated 4 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Updated 2 years ago
- ☆18Updated 6 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Updated 2 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆19Updated 2 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Updated 6 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆49Updated 3 years ago
- ICLR'22 Programmatic Reinforcement Learning☆16Updated 2 years ago
- A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…☆25Updated 6 months ago
- The repository archives papers regarding the combination of combinatorial optimization and machine learning and corresponding reading not…☆165Updated 5 years ago
- ☆12Updated 5 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆52Updated 5 years ago
- Learning to Perform Local Rewriting for Combinatorial Optimization☆154Updated 6 years ago
- A beamer template for LAMDA lab at NJU☆16Updated 5 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago
- This is the code for the paper "A Scalable Neural Network for DSIC Affine Maximizer" in NeurIPS 2023.☆11Updated 2 years ago
- Learning local search heuristics for Boolean satisfiability☆37Updated last year
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 4 years ago