(ICLR'26 + Netflix) Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning
☆39Nov 17, 2025Updated 4 months ago
Alternatives and similar repositories for Rank-GRPO
Users that are interested in Rank-GRPO are comparing it to the libraries listed below
Sorting:
- ☆52Aug 6, 2025Updated 7 months ago
- ☆12Jun 19, 2024Updated last year
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- (WWW'24 + LinkedIn) The first RS that tightly combines LLM with ID-based RS☆173Aug 7, 2024Updated last year
- [TMLR 2025] A general framework for bridging LLMs and recommendation systems via reinforcement learning. https://arxiv.org/pdf/2503.24289☆129Jan 28, 2026Updated last month
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆30Sep 12, 2025Updated 6 months ago
- ☆24Jan 19, 2026Updated 2 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆14Nov 26, 2019Updated 6 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- The official repo of the paper titled DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction.☆22Dec 1, 2025Updated 3 months ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 4 months ago
- The official implementation of Hard Negative Sampling via Large Language Models for Recommendation.☆11Jan 17, 2026Updated 2 months ago
- This is an official pytorch implementation for paper "Temporal-Frequency Co-training for Time Series Semi-supervised Learning" (AAAI-23)…☆15May 17, 2024Updated last year
- ☆13Nov 21, 2025Updated 3 months ago
- ☆16Apr 26, 2023Updated 2 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Oct 12, 2020Updated 5 years ago
- ICLR2023 - Tailoring Language Generation Models under Total Variation Distance☆21Feb 8, 2023Updated 3 years ago
- ICTNet: a novel network for semantic segmentation with the underlying architecture of a fully convolutional network, infused with feature…☆10May 27, 2020Updated 5 years ago
- ☆10May 19, 2025Updated 10 months ago
- ☆11Oct 21, 2017Updated 8 years ago
- (WWW'20) Official codes of paper "multimodal deep variational information bottleneck for micro-video popularity prediction".☆46Dec 9, 2021Updated 4 years ago
- Official implementation of the RSE paper mKGR.☆20Jan 15, 2026Updated 2 months ago
- A marker-based augmented reality camera app for the web, powered by AR.js.☆16Dec 4, 2022Updated 3 years ago
- Earth system foundation model data, training, and eval☆156Updated this week
- PyTorch utilities for ML, specifically speech☆13Jan 30, 2024Updated 2 years ago
- Facilitating learning, using, and designing graph processing pipelines/models systematically.☆27May 21, 2022Updated 3 years ago
- ☆12Oct 28, 2024Updated last year
- An automatic Gaussian process classifier.☆13May 28, 2016Updated 9 years ago
- Repository for the paper "Unsupervised Representation Learning of Spatial Data via Multimodal Embedding"☆12Dec 5, 2019Updated 6 years ago
- [ICLR 2026 Oral] Generative Universal Verifier as Multimodal Meta-Reasoner☆54Nov 14, 2025Updated 4 months ago
- ☆11Feb 5, 2024Updated 2 years ago
- conditional neural process: tranlation of the tensorflow code to Pytorch☆17Apr 7, 2020Updated 5 years ago
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 6 months ago
- ☆11Aug 1, 2024Updated last year
- ☆12Feb 16, 2024Updated 2 years ago