A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.
☆78Feb 7, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-Multi-Token-Prediction
Users that are interested in Awesome-Multi-Token-Prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28May 22, 2025Updated 10 months ago
- [NeurIPS 2025] L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models☆25Oct 29, 2025Updated 5 months ago
- [KDD 2025] Fine-tuning Multimodal Large Language Models for Product Bundling☆15Sep 20, 2025Updated 6 months ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆18Jun 28, 2024Updated last year
- A curated list of Vision (video/image) to Audio Generation☆105Feb 10, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- FlexiTokens☆19Dec 27, 2025Updated 3 months ago
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20May 24, 2025Updated 10 months ago
- ☆21Apr 3, 2026Updated 2 weeks ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆22Dec 7, 2023Updated 2 years ago
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆69Jul 24, 2025Updated 8 months ago
- Diffusion Models for Generative Outfit Recommendation☆38Sep 11, 2024Updated last year
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [KDD'25] LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation.☆62Sep 6, 2025Updated 7 months ago
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆99Nov 29, 2024Updated last year
- A project for tri-modal LLM benchmarking and instruction tuning.☆57Mar 27, 2025Updated last year
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices (ACML 2023)☆16May 7, 2024Updated last year
- A mod to export a BeamNG.drive replay as a sequence of glTF files☆12Jan 23, 2026Updated 2 months ago
- ☆14Jun 24, 2024Updated last year
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 5 months ago
- The implementation of paper "Strategy-aware Bundle Recommender System", SIGIR'23.☆15Sep 4, 2023Updated 2 years ago
- An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…☆10Dec 18, 2019Updated 6 years ago
- ☆17May 25, 2023Updated 2 years ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- ☆16Jul 2, 2022Updated 3 years ago
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation (SIGIR'25)☆29Oct 21, 2025Updated 5 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆14Jun 21, 2023Updated 2 years ago
- [ICLR 2026] The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"☆53Nov 20, 2025Updated 4 months ago
- The implementation of paper "Self-supervised learning for multimedia recommendation", TMM'22.☆10Jul 4, 2022Updated 3 years ago
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆17Dec 20, 2022Updated 3 years ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- Tools for training causal language models for Finnish☆27Jan 14, 2026Updated 3 months ago