chenyuxin1999 / S-DPOView external linksLinks
[NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"
☆96Nov 29, 2024Updated last year
Alternatives and similar repositories for S-DPO
Users that are interested in S-DPO are comparing it to the libraries listed below
Sorting:
- ☆160Jul 12, 2024Updated last year
- [ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can be What Recommenders Need: Findings and Potentials"☆97May 16, 2025Updated 9 months ago
- [KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25☆15Sep 20, 2025Updated 4 months ago
- Official code of "Invariant Collaborative Filtering to Popularity Distribution Shift" (2023 WWW)☆21Jul 27, 2023Updated 2 years ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆22Dec 7, 2023Updated 2 years ago
- [SIGIR 2025] implementation of AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings☆38Apr 15, 2025Updated 10 months ago
- ☆24Nov 16, 2023Updated 2 years ago
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28May 22, 2025Updated 8 months ago
- [ICDE'24] Code of "Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation."☆196Sep 9, 2024Updated last year
- ☆19Sep 5, 2024Updated last year
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 2 weeks ago
- [TMLR 2025] A general framework for bridging LLMs and recommendation systems via reinforcement learning. https://arxiv.org/pdf/2503.24289☆127Jan 28, 2026Updated 2 weeks ago
- [NeurIPS 2023] The implementation of paper "Empowering Collaborative Filtering Generalization via Principled Adversarial Contrastive Loss…☆20Feb 21, 2024Updated last year
- ☆72Sep 1, 2024Updated last year
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)☆39Feb 21, 2025Updated 11 months ago
- The latest paper list of large language model (LLM) for recommendation☆58Jan 30, 2024Updated 2 years ago
- ☆15Feb 26, 2025Updated 11 months ago
- Learnable Item Tokenization for Generative Recommendation (Most Cited Paper at CIKM'24)☆132Jan 1, 2025Updated last year
- A unified, extensible, and reproducible benchmark for collaborative filtering (CF) research.☆24Jun 7, 2025Updated 8 months ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆43Feb 7, 2026Updated last week
- ☆279Feb 5, 2024Updated 2 years ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Oct 23, 2024Updated last year
- [ICLR 2026] The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"☆39Nov 20, 2025Updated 2 months ago
- [SIGIR 2024 perspective] The implementation of paper "On Generative Agents in Recommendation"☆458Jul 7, 2024Updated last year
- ☆395Apr 1, 2025Updated 10 months ago
- ☆25Sep 7, 2025Updated 5 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆29Jan 10, 2026Updated last month
- [ICLR 2025] The implementation of paper "Preference Diffusion for Recommendation"☆23Apr 21, 2025Updated 9 months ago
- [NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"☆41Oct 3, 2025Updated 4 months ago
- [ACL 2024] ReactXT: Understanding Molecular “Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining. by Zhiyuan Liu*, Yaoru…☆27Sep 3, 2024Updated last year
- [CIKM 2023 Oral] This is the code repo for our CIKM‘23 paper "Text Matching Improves Sequential Recommendation by Reducing Popularity Bia…☆39Mar 17, 2024Updated last year
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Jul 24, 2025Updated 6 months ago
- ☆12Jun 19, 2024Updated last year
- ☆10Jul 8, 2021Updated 4 years ago
- Code used in ACL rebuttal☆31Sep 3, 2024Updated last year
- ☆14Jun 18, 2024Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 3 months ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last week