(ICLR'26 + Netflix) Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning
☆37Nov 17, 2025Updated 3 months ago
Alternatives and similar repositories for Rank-GRPO
Users that are interested in Rank-GRPO are comparing it to the libraries listed below
Sorting:
- ☆51Aug 6, 2025Updated 6 months ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆14Apr 4, 2020Updated 5 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- ☆10May 19, 2025Updated 9 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)☆39Feb 21, 2025Updated last year
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- A list of companies focusing on geospatial intelligence, GIS, RS, Climate risks, and more☆20Jul 29, 2025Updated 7 months ago
- ☆13Oct 21, 2024Updated last year
- ☆31Feb 3, 2026Updated 3 weeks ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Feb 20, 2026Updated last week
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- ☆24Dec 19, 2025Updated 2 months ago
- MCP server for Grok AI API integration☆19Jun 2, 2025Updated 8 months ago
- [TMLR 2025] A general framework for bridging LLMs and recommendation systems via reinforcement learning. https://arxiv.org/pdf/2503.24289☆127Jan 28, 2026Updated 3 weeks ago
- This is the source code for Efficient Sequential Recommendation for Long Term User Interest Via Personalization.☆22Nov 18, 2025Updated 3 months ago
- ☆37Oct 29, 2025Updated 3 months ago
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- Microsoft Graph CLI - Mail, Calendar, OneDrive, To-Do, Contacts☆48Jan 26, 2026Updated last month
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Struct-aware fuzzing framework + some fuzzers☆30Jan 28, 2026Updated 3 weeks ago
- ☆19Dec 1, 2025Updated 2 months ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- ☆14Apr 29, 2024Updated last year
- Code for the publication "Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation".☆24Dec 4, 2025Updated 2 months ago
- Langchain-powered natural language interface to knowledge-graphs.☆17Nov 3, 2025Updated 3 months ago
- ☆12Nov 5, 2024Updated last year
- Metadata browser of TREC☆10Feb 20, 2026Updated last week
- ☆12Oct 28, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- This repository contains the source code for a confluence context server, it provides prompts that can be used as slash commands for clie…☆11Jan 24, 2025Updated last year
- The open-source language model computer☆10Mar 22, 2024Updated last year
- Intelligent memory system for OpenWebUI with semantic retrieval, LLM consolidation, and adaptive context injection☆44Dec 2, 2025Updated 2 months ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year