jiaqima/Off-Policy-2-Stage

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jiaqima/Off-Policy-2-Stage)

jiaqima / Off-Policy-2-Stage

Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130

☆30

Alternatives and similar repositories for Off-Policy-2-Stage

Users that are interested in Off-Policy-2-Stage are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LihangLiu / Generator-Evaluator
View on GitHub
☆12Jun 17, 2019Updated 7 years ago
antoine-hochart / bandit_algo_evaluation
View on GitHub
Offline evaluation of multi-armed bandit algorithms
☆23Dec 1, 2020Updated 5 years ago
husnejahan / Multi-armed-bandits-for-dynamic-movie-recommendations
View on GitHub
Multi-armed bandits for dynamic movie recommendations
☆14Nov 20, 2019Updated 6 years ago
criteo-research / blob
View on GitHub
Source code for our paper "BLOB: a probabilistic model for recommendation that combines organic and bandit signals" published at KDD 2020…
☆16Mar 24, 2023Updated 3 years ago
spotify-research / RIPS_KDD2020
View on GitHub
☆19Sep 9, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
collinprather / SlateQ
View on GitHub
A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms
☆38Dec 27, 2022Updated 3 years ago
adith387 / slates_semisynth_expts
View on GitHub
Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.
☆43Nov 2, 2017Updated 8 years ago
criteo-research / bandit-reco
View on GitHub
☆51Jan 3, 2021Updated 5 years ago
AaronJi / RL
View on GitHub
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
☆27Feb 7, 2022Updated 4 years ago
georgehc / mnar_mc
View on GitHub
☆12Nov 2, 2021Updated 4 years ago
gzn00417 / DH-GEM
View on GitHub
KDD'22 ''Talent Demand-Supply Joint Prediction with Dynamic Heterogeneous Graph Enhanced Meta-Learning''
☆14Feb 21, 2023Updated 3 years ago
hasteck / Higher_Recsys_2021
View on GitHub
☆25Aug 25, 2021Updated 4 years ago
XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation
View on GitHub
A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.
☆40Nov 26, 2019Updated 6 years ago
wuliwei9278 / SSE
View on GitHub
Partial Codes and datasets for NeurIPS'19 "Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers"
☆20Nov 1, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
usaito / asymmetric-tri-rec-real
View on GitHub
(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’
☆21Nov 21, 2022Updated 3 years ago
wangshaonan / Associative-multichannel-autoencoder
View on GitHub
code for EMNLP2018 paper 'Associative-multichannel-autoencoder for multimodal word representation'
☆13Aug 24, 2018Updated 7 years ago
olivierjeunen / ease-side-info-recsys-2020
View on GitHub
Source code for our LBR paper "Closed-Form Models for Collaborative Filtering with Side-Information" published at RecSys 2020.
☆15Jul 22, 2021Updated 5 years ago
StatsDLMathsRecomSys / Adversarial-Counterfactual-Learning-and-Evaluation-for-Recommender-System
View on GitHub
☆22Jan 14, 2021Updated 5 years ago
modriczhang / HRL-Rec
View on GitHub
"Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580
☆58Sep 12, 2021Updated 4 years ago
aiueola / wsdm2022-cascade-dr
View on GitHub
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Jul 16, 2023Updated 3 years ago
BestActionNow / Slate_Aware_Ranking
View on GitHub
The implementation for our paper "Slate-Aware Ranking for Recommendation" accepted by WSDM.23
☆16Dec 13, 2022Updated 3 years ago
ardaegeunlu / X-armed-Bandits
View on GitHub
Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
☆11Jul 12, 2018Updated 8 years ago
Meesho / llm_calculator
View on GitHub
☆13Jul 17, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aaai17 / geo_teaser
View on GitHub
☆10Sep 17, 2016Updated 9 years ago
awarebayes / RecNN
View on GitHub
Reinforced Recommendation toolkit built around pytorch 1.7
☆589Dec 8, 2020Updated 5 years ago
HarrieO / 2021-SIGIR-plackett-luce
View on GitHub
☆32Jul 4, 2022Updated 4 years ago
layer6ai-labs / TAFA
View on GitHub
Code for the RecSys'20 paper "TAFA: Two-headed Attention Fused Autoencoder for Context-Aware Recommendations"
☆19Aug 15, 2020Updated 5 years ago
BetsyHJ / SOFA
View on GitHub
A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.
☆21Nov 29, 2020Updated 5 years ago
yiqiwang8177 / Official-codebase-for-Decision-Transducer
View on GitHub
This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…
☆11Oct 9, 2023Updated 2 years ago
CharlieMat / PivotCVAE
View on GitHub
This is the implementation code for the WWW2021 paper "Variation Control and Evaluation for Generative Slate Recommendation"
☆15Jun 7, 2021Updated 5 years ago
xkianteb / ApproPO
View on GitHub
Reinforcement Learning with Convex Constraints
☆14Apr 6, 2022Updated 4 years ago
cryptedp / unsupervised_videos_pytorch
View on GitHub
Implementation of the paper Unsupervised Learning of Video Representations using LSTMs
☆10Nov 24, 2017Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nlpaueb / BioIR
View on GitHub
Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.
☆15Jul 13, 2017Updated 9 years ago
BartyzalRadek / contextual-bandits-recommender
View on GitHub
Implementing LinUCB and HybridLinUCB in Python.
☆49May 15, 2018Updated 8 years ago
finn-no / recsys_slates_dataset
View on GitHub
FINN.no Slate Dataset for Recommender Systems. A dataset containing all interactions (viewed items + response (clicked item / no click) f…
☆54Jan 29, 2023Updated 3 years ago
rec-agent / drr
View on GitHub
code for the paper "Personalized Context-Aware Re-ranking for E-commerce Recommendation Systems"
☆52Jan 23, 2019Updated 7 years ago
rahmanidashti / STACP
View on GitHub
Joint Geographical and Temporal Modeling based on Matrix Factorization for Point-of-Interest Recommendation - ECIR 2020
☆24May 29, 2021Updated 5 years ago
ecom-research / CRM-LTR
View on GitHub
Mend Your Learning Approach, Not the Data for Ranking E-Commerce Products
☆23Mar 31, 2020Updated 6 years ago
berkeley-reclab / RecLab
View on GitHub
☆67Feb 16, 2023Updated 3 years ago