(ICLR'26 + Netflix) Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning
☆51May 23, 2026Updated 2 weeks ago
Alternatives and similar repositories for Rank-GRPO
Users that are interested in Rank-GRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] The implementation of paper "Preference Diffusion for Recommendation"☆28Apr 21, 2025Updated last year
- ☆12Jun 19, 2024Updated last year
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- (WWW'24 + LinkedIn) The first RS that tightly combines LLM with ID-based RS☆173Aug 7, 2024Updated last year
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 10 months ago
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆30Apr 23, 2026Updated last month
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆14Nov 26, 2019Updated 6 years ago
- https://www.kaggle.com/c/nbme-score-clinical-patient-notes☆10Sep 1, 2022Updated 3 years ago
- ☆50Nov 24, 2024Updated last year
- ☆10Aug 14, 2020Updated 5 years ago
- The official repo of the paper titled DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction.☆23May 25, 2026Updated 2 weeks ago
- The official implementation of Hard Negative Sampling via Large Language Models for Recommendation.☆11Jan 17, 2026Updated 4 months ago
- Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models☆19Jun 18, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Apr 26, 2023Updated 3 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- ICLR2023 - Tailoring Language Generation Models under Total Variation Distance☆21Feb 8, 2023Updated 3 years ago
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆41Dec 8, 2024Updated last year
- WWW 2023 "Mutual Wasserstein Discrepancy Minimization for Sequential Recommendation"☆16Jun 20, 2023Updated 2 years ago
- ICTNet: a novel network for semantic segmentation with the underlying architecture of a fully convolutional network, infused with feature…☆10May 27, 2020Updated 6 years ago
- ☆11Aug 31, 2024Updated last year
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆23Oct 28, 2025Updated 7 months ago
- ☆11Oct 21, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆35Sep 12, 2025Updated 8 months ago
- ☆11Jul 25, 2021Updated 4 years ago
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- (WWW'20) Official codes of paper "multimodal deep variational information bottleneck for micro-video popularity prediction".☆46Dec 9, 2021Updated 4 years ago
- This is an official pytorch implementation for paper "Temporal-Frequency Co-training for Time Series Semi-supervised Learning" (AAAI-23)…☆15May 17, 2024Updated 2 years ago
- Posterior with interesting shapes from actually used models☆13Feb 10, 2025Updated last year
- [RSE25] Official implementation of the paper mKGR.☆22May 17, 2026Updated 3 weeks ago
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆18Nov 24, 2024Updated last year
- ☆33Jun 29, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICCV25] Official implementation of the paper HoliTracer.☆48Apr 7, 2026Updated 2 months ago
- Facilitating learning, using, and designing graph processing pipelines/models systematically.☆27May 21, 2022Updated 4 years ago
- ☆14Mar 2, 2023Updated 3 years ago
- ☆13Oct 25, 2024Updated last year
- [CIKM '24] Implementation of "Multi-Behavior Generative Recommendation"☆58Aug 24, 2024Updated last year
- An automatic Gaussian process classifier.☆13May 28, 2016Updated 10 years ago
- [SIGIR'2024] "SelfGNN: Self-Supervised Graph Neural Networks for Sequential Recommendation"☆73Jun 10, 2024Updated last year