uclaml / NeuralUCB
☆28Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for NeuralUCB
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆89Updated 2 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆77Updated 3 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 2 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆48Updated 5 years ago
- python implementation of the TPGR☆39Updated 5 years ago
- A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.☆39Updated 4 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆46Updated 4 years ago
- ☆14Updated 4 years ago
- Exact Pareto Optimal solutions for preference based Multi-Objective Optimization☆55Updated 2 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆14Updated 2 years ago
- Implementation for our paper in NeurIPS 2019☆47Updated 4 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17Updated 3 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 3 years ago
- ☆85Updated 3 months ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆37Updated 3 years ago
- ☆15Updated last year
- "Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580☆53Updated 3 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Updated 4 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 4 years ago
- ☆30Updated 4 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆33Updated 5 years ago
- rushhan / Generative-Adversarial-User-Model-for-Reinforcement-Learning-Based-Recommendation-System-Pytorch☆40Updated last year
- ☆22Updated 3 years ago
- This is the official implementation for COSMOS: a method to learn Pareto fronts that scales to large datasets and deep models.☆36Updated 3 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆20Updated 3 years ago
- Thompson Sampling Tutorial☆51Updated 5 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- Code for Policy Learning for Fairness in Ranking paper at NeurIPS 2019☆20Updated 2 years ago