Feng-Hong / WINO-DLLMLinks
☆12Updated this week
Alternatives and similar repositories for WINO-DLLM
Users that are interested in WINO-DLLM are comparing it to the libraries listed below
Sorting:
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Updated last year
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆49Updated 8 months ago
- Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆45Updated last month
- Code release for VTW (AAAI 2025 Oral)☆48Updated 3 weeks ago
- ☆49Updated 8 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆602Updated last week
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆154Updated 2 weeks ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆94Updated 8 months ago
- ☆103Updated last month
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆364Updated 7 months ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆805Updated 3 weeks ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆69Updated 7 months ago
- ☆31Updated last month
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆219Updated 8 months ago
- [arXiv 2025] Efficient Reasoning Models: A Survey☆248Updated this week
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Updated last year
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆92Updated 8 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆114Updated 2 weeks ago
- 关于LLM和Multimodal LLM的paper list☆42Updated last month
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆138Updated 2 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆243Updated 3 months ago
- Instruction Tuning in Continual Learning paradigm☆55Updated 6 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆22Updated 3 weeks ago
- ☆95Updated 4 months ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆31Updated 7 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆497Updated this week
- Paper list for Efficient Reasoning.☆586Updated this week
- ☆27Updated 11 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆302Updated 10 months ago
- a brief repo about paper research☆15Updated 11 months ago