yaochenzhu/Rank-GRPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yaochenzhu/Rank-GRPO)

yaochenzhu / Rank-GRPO

(ICLR'26 + Netflix) Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

☆53

Alternatives and similar repositories for Rank-GRPO

Users that are interested in Rank-GRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yaochenzhu / LLM4Rec
View on GitHub
(WWW'24 + LinkedIn) The first RS that tightly combines LLM with ID-based RS
☆174Aug 7, 2024Updated last year
Code2Q / TagCF
View on GitHub
☆17Nov 6, 2025Updated 8 months ago
USTC-StarTeam / GE4Rec
View on GitHub
ICML 2025 | GE4Rec: supervised feature generation paradigm for CTR prediction models.
☆37Jun 10, 2026Updated last month
sober-clever / ReRe
View on GitHub
The implementations of paper "Reinforced Preference Optimization for Recommendation" (ReRe).
☆20Nov 16, 2025Updated 8 months ago
yaochenzhu / MMVED
View on GitHub
(WWW'20) Official codes of paper "multimodal deep variational information bottleneck for micro-video popularity prediction".
☆46Dec 9, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
agiresearch / DeSocial
View on GitHub
☆15Jan 19, 2026Updated 6 months ago
xuwenxinedu / R3
View on GitHub
☆30Apr 7, 2026Updated 3 months ago
YinhanHe123 / SemCoT
View on GitHub
Official implementation for NeurIPS 2025 paper "SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tok…
☆22Nov 7, 2025Updated 8 months ago
google-deepmind / action_piece
View on GitHub
☆66Jul 2, 2026Updated 2 weeks ago
Jamesding000 / MemGen-GR
View on GitHub
The code implementation for our KDD 2026 Oral paper "How Well Does Generative Recommendation Generalize?"
☆40Jun 2, 2026Updated last month
rutgerswiselab / MemRec
View on GitHub
MemRec
☆77Mar 17, 2026Updated 4 months ago
yale-nlp / Bright-Pro
View on GitHub
Data and code for ACL 2026 Paper "Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems…
☆18Apr 30, 2026Updated 2 months ago
liuqidong07 / LLM-ESR
View on GitHub
[NeurIPS'24 Spotlight] The official implementation code of LLM-ESR.
☆56Jun 20, 2026Updated last month
ZhixunLEE / FairGB
View on GitHub
[SIGKDD 2024] Rethinking Fair Graph Neural Networks from Re-balancing
☆10Jul 15, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AxiomMath / lattice-triangle
View on GitHub
Lean formalizations for the paper "On the paucity of lattice triangles"
☆18Mar 26, 2026Updated 3 months ago
jordane95 / dual-cross-encoder
View on GitHub
Dual Cross Encoder for Dense Retrieval
☆18Mar 15, 2023Updated 3 years ago
ejhshen / SLIM
View on GitHub
Implementation of SLIM, a framework of dynamics skill lifecycle management for agentic reinforcement learning
☆22May 12, 2026Updated 2 months ago
JennyXieJiayi / HMMVED
View on GitHub
The implementation of HMMVED.
☆18Jul 20, 2022Updated 4 years ago
yaochenzhu / awesome-books-for-causality
View on GitHub
Books and posts to understand Pearl and Rubin's view on causality and their disputes.
☆13May 7, 2022Updated 4 years ago
callmespring / CausalRL
View on GitHub
Implementation of "Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework" (JASA 2023)
☆32Oct 19, 2023Updated 2 years ago
yewzz / EAGER
View on GitHub
☆35Jul 19, 2024Updated 2 years ago
zhaijianyang / MQL4GRec
View on GitHub
☆57Apr 1, 2025Updated last year
HappyPointer / LLM2Rec
View on GitHub
[KDD'25] LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation.
☆68Sep 6, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yaochenzhu / VBAE
View on GitHub
(TKDE'22) Official codes of "Collaborative variational bandwidth auto-encoder (VBAE) for recommender systems".
☆18Jul 16, 2022Updated 4 years ago
Elvin-Yiming-Du / Memory-T1
View on GitHub
This respository is used for time reasoning task for mult-session dialogue system.
☆16Feb 7, 2026Updated 5 months ago
HKUDS / RecGPT
View on GitHub
[EMNLP2025] "RecGPT: A Foundation Model for Sequential Recommendation"
☆59Oct 14, 2025Updated 9 months ago
yaochenzhu / CRAG
View on GitHub
(WWW'25 + Netflix) The first CRS that retrieves collaborative filtering knowledge with two-step context-aware reflection.
☆21Sep 10, 2025Updated 10 months ago
hupeiyu21 / GenCDR
View on GitHub
☆24Feb 1, 2026Updated 5 months ago
facebookresearch / RPG_KDD2025
View on GitHub
This repository provides the code for implementing RPG described in our KDD'25 paper "Generating Long Semantic IDs in Parallel for Recomm…
☆140Jul 2, 2026Updated 2 weeks ago
KRLabsOrg / squeez
View on GitHub
Squeeze verbose LLM agent tool output down to only the relevant lines
☆20Apr 27, 2026Updated 2 months ago
lswhim / PreferDiff
View on GitHub
[ICLR 2025] The implementation of paper "Preference Diffusion for Recommendation"
☆28Apr 21, 2025Updated last year
BaohaoLiao / frac-cot
View on GitHub
[COLM 2026] An efficient 3D sampling method for long-CoT LLM.
☆16May 25, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
FengLiu-1 / FMRec
View on GitHub
☆21Feb 11, 2025Updated last year
WxxShirley / Agent-STAR
View on GitHub
Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"
☆32May 12, 2026Updated 2 months ago
HansiZeng / CL-DRD
View on GitHub
[SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".
☆23Apr 29, 2022Updated 4 years ago
HeYueThu / CausPref
View on GitHub
Code of paper "CAUSPref: Causal Preference Learning for Out-of-Distribution Recommendation" (the WebConf22)
☆20Mar 11, 2022Updated 4 years ago
Ruiyang-061X / SketchThinker-R1
View on GitHub
[ICLR'26] SketchThinker-R1: Towards Efficient Sketch-Style Reasoning in Large Multimodal Models
☆17Mar 26, 2026Updated 3 months ago
chengang95 / UnKD
View on GitHub
☆15Jun 15, 2023Updated 3 years ago
luyi256 / ScaleGUN
View on GitHub
☆10Mar 23, 2025Updated last year