moaradwan / deep-learning-contextual-bandits
Deep learning models for contextual multi-armed bandit setting
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for deep-learning-contextual-bandits
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- ☆12Updated 3 months ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- This repository contains code for the paper "Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs", RecSys…☆16Updated last week
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- ☆25Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated last week
- ☆17Updated 4 months ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆36Updated 4 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆41Updated last year
- Generalised UDRL☆37Updated 2 years ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆20Updated 2 years ago
- Dateset Reset Policy Optimization☆28Updated 7 months ago
- ☆23Updated 7 months ago
- "Probabilistic Embeddings Revisited" paper official repository☆25Updated last year
- ☆25Updated 2 weeks ago
- ☆29Updated 2 years ago
- ☆19Updated 11 months ago
- ☆24Updated 6 months ago
- ☆29Updated 2 years ago
- ☆14Updated last month
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Implementation of Decision Stacks: Flexible RL via Modular Generative Models☆12Updated last year
- Neuroevolution Benchmark in JAX 🦕☆36Updated last year
- flexible meta-learning in jax☆12Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆23Updated 4 months ago
- Building blocks for productive research☆44Updated 2 weeks ago
- ☆14Updated 9 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆19Updated 5 months ago