PyTorch port and extension of the Deep Bayesian Bandits Library
☆43Sep 4, 2019Updated 6 years ago
Alternatives and similar repositories for pytorch-deep-bayesian-bandits
Users that are interested in pytorch-deep-bayesian-bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- ☆106Sep 13, 2021Updated 4 years ago
- Coherent Soft Imitation Learning☆23Jul 30, 2024Updated last year
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Aug 22, 2018Updated 7 years ago
- CIKM 2022: SVD-GCN: A Simplified Graph Convolution Paradigm for Recommendation☆21Oct 10, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆23Jul 6, 2023Updated 2 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆103Dec 14, 2021Updated 4 years ago
- Python tools for solving data-constrained finite element problems☆13Nov 9, 2021Updated 4 years ago
- Longformer for MS MARCO document re-ranking task.☆20Jan 11, 2021Updated 5 years ago
- Dataset of spoken conversational search utterances☆14Aug 27, 2021Updated 4 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Mar 6, 2025Updated last year
- Code for the ICCV 2021 paper "Augmented Lagrangian Adversarial Attacks"☆24Mar 28, 2024Updated 2 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Jun 9, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs —without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Deep learning models for contextual multi-armed bandit setting☆13May 16, 2021Updated 5 years ago
- ☆14Aug 16, 2022Updated 3 years ago
- Example lecture for Constraint Satisfaction Problems in an interactive jupyter notebook. With python code to solve CSPs, with visualizati…☆15Dec 7, 2018Updated 7 years ago
- ☆20Oct 19, 2022Updated 3 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Code for "Learning to Generate Reviews and Discovering Sentiment"☆15Nov 7, 2017Updated 8 years ago
- ☆11Oct 15, 2020Updated 5 years ago
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- A library containing a collection of distance and similarity measures for data analysis☆16Mar 25, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Accelerated Confergence for Counterfactual Learning to Rank☆17Jan 21, 2022Updated 4 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆24Feb 16, 2023Updated 3 years ago
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 4 years ago
- ☆16May 31, 2017Updated 9 years ago
- Implementation of different Relative Entropy Policy Search flavors☆13Nov 15, 2021Updated 4 years ago
- ☆45Apr 22, 2025Updated last year
- ☆36Jun 12, 2023Updated 3 years ago
- Source code for paper Choromanska et al. -- Beyond Backprop: Online Alternating Minimization with Auxiliary Variables -- http://proceedin…☆24Oct 29, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- ☆17May 25, 2023Updated 3 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 5 years ago
- A tracer to generate sequence diagrams from running Python programs.☆16Feb 5, 2019Updated 7 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆32Apr 3, 2018Updated 8 years ago
- [AAMAS'26] xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆26Jan 8, 2026Updated 5 months ago