Xiaohao-Liu / Awesome-Multi-Token-PredictionLinks
A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.
☆33Updated last week
Alternatives and similar repositories for Awesome-Multi-Token-Prediction
Users that are interested in Awesome-Multi-Token-Prediction are comparing it to the libraries listed below
Sorting:
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28Updated 8 months ago
- ☆56Updated last year
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆185Updated 4 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆53Updated 10 months ago
- ☆152Updated 8 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Updated 2 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆102Updated last year
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆65Updated 9 months ago
- ☆155Updated 11 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆104Updated 3 weeks ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆74Updated 7 months ago
- ☆112Updated 4 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆32Updated 6 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆240Updated 3 months ago
- Survey on Data-centric Large Language Models☆88Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆76Updated 7 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆104Updated 4 months ago
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.☆71Updated 10 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆102Updated 2 weeks ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆80Updated 3 months ago
- A curated list of Awesome Personalized Large Multimodal Models resources☆50Updated 2 weeks ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆130Updated 4 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆234Updated last year
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆385Updated 3 months ago
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆37Updated last month
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆46Updated 6 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆24Updated last year
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆110Updated last month
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆137Updated 5 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆93Updated last year