Xiaohao-Liu / Awesome-Multi-Token-PredictionView external linksLinks
A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.
☆43Feb 7, 2026Updated last week
Alternatives and similar repositories for Awesome-Multi-Token-Prediction
Users that are interested in Awesome-Multi-Token-Prediction are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] An official source code for paper "L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models"☆22Oct 29, 2025Updated 3 months ago
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28May 22, 2025Updated 8 months ago
- Improving large language models with concept-aware fine-tuning (CAFT)☆29Jan 31, 2026Updated 2 weeks ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆17Jun 28, 2024Updated last year
- A curated list of Vision (video/image) to Audio Generation☆98Feb 10, 2026Updated last week
- Implementation of our paper, "MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models".☆18Apr 16, 2025Updated 10 months ago
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20May 24, 2025Updated 8 months ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆22Dec 7, 2023Updated 2 years ago
- [KDD'25] LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation.☆57Sep 6, 2025Updated 5 months ago
- Diffusion Models for Generative Outfit Recommendation☆37Sep 11, 2024Updated last year
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆96Nov 29, 2024Updated last year
- Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)☆39Feb 21, 2025Updated 11 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- [CVPR 2025] Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation☆19Dec 18, 2025Updated last month
- Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".☆43Jan 21, 2026Updated 3 weeks ago
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Jul 24, 2025Updated 6 months ago
- The implementation of paper "Self-supervised learning for multimedia recommendation", TMM'22.☆10Jul 4, 2022Updated 3 years ago
- ☆15Jan 25, 2025Updated last year
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆11Mar 20, 2023Updated 2 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- ☆10May 23, 2022Updated 3 years ago
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Apr 24, 2024Updated last year
- (IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods☆13Aug 23, 2023Updated 2 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 3 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Offical Implementation for Oriented Object Detection via Contextual Dependence Mining and Penalty-Incentive Allocation☆10Dec 20, 2023Updated 2 years ago
- ☆12Feb 26, 2025Updated 11 months ago
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation (SIGIR'25)☆25Oct 21, 2025Updated 3 months ago
- ☆10Jun 22, 2022Updated 3 years ago
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- Localization of Knowledge in Text-to-Image Models☆12Oct 8, 2024Updated last year
- Chinese notes of SplaTam(3DGS-based SLAM)☆15Feb 23, 2025Updated 11 months ago
- Cornell Tech CS5670 Introduction to Computer Vision Projects Repo☆13Nov 22, 2022Updated 3 years ago
- ☆17May 25, 2023Updated 2 years ago
- ☆17Nov 11, 2024Updated last year
- This repository hosts the source code for the paper "ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Mo…☆16Dec 16, 2025Updated 2 months ago