A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.
☆49Feb 7, 2026Updated last month
Alternatives and similar repositories for Awesome-Multi-Token-Prediction
Users that are interested in Awesome-Multi-Token-Prediction are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] An official source code for paper "L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models"☆23Oct 29, 2025Updated 4 months ago
- [KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25☆15Sep 20, 2025Updated 5 months ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆17Jun 28, 2024Updated last year
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20May 24, 2025Updated 9 months ago
- Official code of "Invariant Collaborative Filtering to Popularity Distribution Shift" (2023 WWW)☆21Jul 27, 2023Updated 2 years ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆22Dec 7, 2023Updated 2 years ago
- [KDD'25] LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation.☆58Sep 6, 2025Updated 6 months ago
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆96Nov 29, 2024Updated last year
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 4 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- ☆11Jan 12, 2023Updated 3 years ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆57Mar 27, 2025Updated 11 months ago
- ☆15Jan 25, 2025Updated last year
- Pytorch implementation of deep fill v2 (original by Jiayu et al.)☆10Jun 26, 2019Updated 6 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last month
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Apr 24, 2024Updated last year
- [TGRS 2023] Official code for "EARL: An Elliptical Distribution aided Adaptive Label Assignment for Oriented Object Detection in Remote S…☆14Oct 11, 2023Updated 2 years ago
- Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".☆45Jan 21, 2026Updated last month
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 4 months ago
- ☆12Jun 15, 2023Updated 2 years ago
- ☆12Sep 12, 2024Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Offical Implementation for Oriented Object Detection via Contextual Dependence Mining and Penalty-Incentive Allocation☆10Dec 20, 2023Updated 2 years ago
- The project page of paper: Aha! Adaptive History-driven Attack for Decision-based Black-box Models [ICCV 2021]☆10Feb 23, 2022Updated 4 years ago
- Cornell Tech CS5670 Introduction to Computer Vision Projects Repo☆13Nov 22, 2022Updated 3 years ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- ☆10Jun 22, 2022Updated 3 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year
- ☆12Feb 26, 2025Updated last year
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation (SIGIR'25)☆25Oct 21, 2025Updated 4 months ago
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- PyTorch Implementation of "BOOTPLACE: Bootstrapped Object Placement with Detection Transformers", CVPR 2025☆24Aug 8, 2025Updated 7 months ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 9 months ago
- The official repos of "Knowledge Bridger: Towards Training-Free Missing Modality Completion"☆21Jun 30, 2025Updated 8 months ago
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆17Dec 11, 2024Updated last year
- This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".☆27Aug 30, 2025Updated 6 months ago