shuaili8 / Bandit_book_solutionsLinks
☆13Updated 3 years ago
Alternatives and similar repositories for Bandit_book_solutions
Users that are interested in Bandit_book_solutions are comparing it to the libraries listed below
Sorting:
- ☆11Updated 5 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Updated 3 years ago
- Implementation of Optimal Auctions through Deep Learning☆133Updated 5 years ago
- ☆44Updated 5 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 4 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 3 years ago
- ☆106Updated 4 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Updated 6 years ago
- ☆131Updated last year
- ☆25Updated 4 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 3 years ago
- Personal Repo to keep track of RL papers☆31Updated 4 years ago
- ☆15Updated 2 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆19Updated 2 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Learning local search heuristics for Boolean satisfiability☆37Updated last year
- ☆12Updated 5 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆71Updated 5 years ago
- ☆47Updated last week
- A beamer template for LAMDA lab at NJU☆17Updated 5 years ago
- ☆102Updated 5 years ago
- Play with the solutions to the multi-armed-bandit problem.☆414Updated last year
- A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…☆25Updated 4 months ago
- ☆71Updated 5 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆34Updated 4 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆138Updated 3 years ago
- ☆14Updated 5 years ago