shuaili8 / Bandit_book_solutionsLinks
☆13Updated 3 years ago
Alternatives and similar repositories for Bandit_book_solutions
Users that are interested in Bandit_book_solutions are comparing it to the libraries listed below
Sorting:
- ☆11Updated 5 years ago
- ☆43Updated 5 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Updated 3 years ago
- Implementation of Optimal Auctions through Deep Learning☆133Updated 5 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆98Updated 3 years ago
- ☆106Updated 4 years ago
- Learning to Perform Local Rewriting for Combinatorial Optimization☆151Updated 5 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated 2 years ago
- ☆47Updated last week
- ☆131Updated last year
- ☆25Updated 4 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆71Updated 5 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆19Updated 2 years ago
- ☆11Updated 5 years ago
- The repository archives papers regarding the combination of combinatorial optimization and machine learning and corresponding reading not…☆165Updated 4 years ago
- ☆18Updated 6 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆50Updated 5 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Updated 5 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆20Updated 7 years ago
- FEN Code☆38Updated 5 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆95Updated 4 months ago
- Assignments for CS294-112 Fall2018 in Pytorch☆64Updated 7 years ago
- Implementation of ECO-DQN as reported in "Exploratory Combinatorial Optimization with Reinforcement Learning".☆80Updated 4 years ago
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆65Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆34Updated 4 years ago
- Machine Learning for Combinatorial Optimization - NeurIPS'21 competition☆137Updated 3 years ago
- ☆15Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆24Updated last year