Kamichanw / Speculative-Ensemble

[ICML'25] Official code of paper "Speculative Ensemble: Fast Large Language Model Ensemble via Speculation"

☆14

Alternatives and similar repositories for Speculative-Ensemble

Users that are interested in Speculative-Ensemble are comparing it to the libraries listed below

Sorting:

1zhou-Wang / MemVR
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆53Updated this week
ShadeCloak / ADORA
☆43Updated last month
zhaochen0110 / OpenThinkIMG
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
☆50Updated this week
ECNU-ICALK / EduChat-Math
☆30Updated 6 months ago
OpenGVLab / V2PE
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆46Updated 5 months ago
findalexli / mllm-dpo
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆44Updated 6 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆67Updated 3 months ago
gyhdog99 / MoCLE
MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)
☆37Updated last year
pipilurj / bootstrapped-preference-optimization-BPO
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆55Updated 8 months ago
Quinn777 / AtomThink
Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?
☆24Updated 2 months ago
UCSC-VLAA / VLAA-Thinking
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
☆107Updated 3 weeks ago
SUSTechBruce / LOOK-M
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆93Updated 6 months ago
arctanxarc / MC-LLaVA
Official implementation of MC-LLaVA.
☆26Updated 3 months ago
vlf-silkie / VLFeedback
☆99Updated last year
CRIPAC-DIG / LogicCheckGPT
[ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…
☆20Updated 3 months ago
Osilly / dynamic_llava
[ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…
☆36Updated 5 months ago
RainBowLuoCS / DEEM
(ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆34Updated 2 months ago
Hongcheng-Gao / HAVEN
Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".
☆14Updated last month
The-Martyr / CausalMM
[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
☆27Updated last week
RupertLuo / VoCoT
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
☆56Updated 10 months ago
pengts / VW-LMM
☆25Updated last year
yfzhang114 / LLaVA-Align
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…
☆78Updated 2 months ago
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆38Updated 10 months ago
lzhxmu / VTW
Code release for VTW (AAAI 2025) Oral
☆39Updated 3 months ago
simplelifetime / TIVE
Less is More: High-value Data Selection for Visual Instruction Tuning
☆12Updated 3 months ago
pengshuai-rin / MultiMath
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆28Updated 3 months ago
zwq2018 / Multi-modal-Self-instruct
The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…
☆79Updated 3 months ago
Yuqifan1117 / HalluciDoctor
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)
☆45Updated 10 months ago
YuxiXie / V-DPO
Preference Learning for LLaVA
☆44Updated 6 months ago
Kevinz-code / SeVa
[MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501
☆55Updated 9 months ago