inFaaa / EvolverLinks
[COLING 2025π₯] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
β16Updated last year
Alternatives and similar repositories for Evolver
Users that are interested in Evolver are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β104Updated last year
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β25Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β85Updated last year
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β54Updated 10 months ago
- β33Updated 8 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiencyβ136Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ91Updated 11 months ago
- β88Updated last year
- A Self-Training Framework for Vision-Language Reasoningβ88Updated last year
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation inβ¦β171Updated 4 months ago
- Code release for VTW (AAAI 2025 Oral)β64Updated 3 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Taskβ36Updated 9 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β57Updated last year
- β64Updated 2 weeks ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)β45Updated 7 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?β38Updated 7 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ134Updated 5 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generationβ39Updated 6 months ago
- β¨β¨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audioβ52Updated 7 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ70Updated 6 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".β84Updated 7 months ago
- β36Updated 3 weeks ago
- β136Updated 2 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Modelsβ77Updated last year
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language modelsβ77Updated 6 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β105Updated 5 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language modelβ47Updated last year
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.β71Updated 10 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ104Updated 4 months ago
- Official implement of MIA-DPOβ70Updated last year