shuaili8 / Bandit_book_solutions
☆13Updated 2 years ago
Alternatives and similar repositories for Bandit_book_solutions:
Users that are interested in Bandit_book_solutions are comparing it to the libraries listed below
- ☆11Updated 4 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 3 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Updated 5 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆18Updated last year
- ☆14Updated 4 years ago
- ☆30Updated 4 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆20Updated 3 years ago
- ☆24Updated 2 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 6 years ago
- ☆18Updated 5 years ago
- ☆26Updated 5 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆70Updated 5 years ago
- Theory of Reinforcement Learning☆16Updated 3 years ago
- Representation Learning in RL☆16Updated 2 years ago
- ☆97Updated 4 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆16Updated 2 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆92Updated 3 years ago
- ☆85Updated 7 months ago
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago
- python implementation of the TPGR☆39Updated 5 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆151Updated last year
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- ☆60Updated 6 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)☆35Updated 5 years ago
- Implementation of Optimal Auctions through Deep Learning☆123Updated 5 years ago
- ☆19Updated 2 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago