inFaaa / EvolverLinks
[COLING 2025π₯] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
β16Updated last year
Alternatives and similar repositories for Evolver
Users that are interested in Evolver are comparing it to the libraries listed below
Sorting:
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β25Updated last year
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β103Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β85Updated last year
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation inβ¦β186Updated 4 months ago
- β25Updated last year
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ132Updated 4 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β58Updated last year
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β104Updated 5 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Taskβ36Updated 9 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β53Updated 10 months ago
- (ICLR 2026 π₯) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"β73Updated 4 months ago
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimoβ¦β20Updated 9 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chainsβ71Updated 6 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)β45Updated 7 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ89Updated 11 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Modelsβ74Updated 8 months ago
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rβ¦β109Updated 6 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ104Updated 4 months ago
- Code release for VTW (AAAI 2025 Oral)β64Updated 2 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Modelsβ77Updated last year
- β88Updated last year
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)β88Updated 4 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generationβ39Updated 6 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.β89Updated 11 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attribβ¦β32Updated 6 months ago
- β64Updated last week
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?β38Updated 7 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"β110Updated last month
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501β61Updated last year
- Doodling our way to AGI βοΈ πΌοΈ π§β120Updated 8 months ago