moaradwan / deep-learning-contextual-bandits
Deep learning models for contextual multi-armed bandit setting
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for deep-learning-contextual-bandits
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆37Updated 4 months ago
- ☆29Updated 2 years ago
- ☆13Updated 4 months ago
- ☆25Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Minimal code for A Generalist Agent☆36Updated 2 years ago
- This repository contains code for the paper "Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs", RecSys…☆17Updated 2 weeks ago
- ☆17Updated 5 months ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated last month
- ☆25Updated 3 weeks ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 3 weeks ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- Building blocks for productive research☆45Updated last week
- ☆29Updated 2 years ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆23Updated 3 weeks ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆52Updated last year
- "Probabilistic Embeddings Revisited" paper official repository☆25Updated last year
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year
- ☆20Updated 11 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆21Updated 3 weeks ago
- ☆29Updated 2 months ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆20Updated 11 months ago
- Generalised UDRL☆37Updated 2 years ago