Stevetich / EventHallusion
EventHallusion: Diagnosing Event Hallucinations in Video LLMs
☆30Updated 3 weeks ago
Alternatives and similar repositories for EventHallusion
Users that are interested in EventHallusion are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆24Updated 7 months ago
- ☆24Updated 6 months ago
- ☆21Updated 3 months ago
- MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆34Updated last month
- Envolving Temporal Reasoning Capability into LMMs via Temporal Consistent Reward☆35Updated last month
- The official repository for paper "PruneVid: Visual Token Pruning for Efficient Video Large Language Models".☆38Updated 2 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆16Updated 9 months ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆33Updated last month
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆75Updated last month
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆33Updated 9 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆81Updated 3 weeks ago
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆27Updated this week
- 🎉 The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorc…☆11Updated 3 weeks ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆15Updated last month
- [CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".☆28Updated last week
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆45Updated 9 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆56Updated last month
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆20Updated 2 months ago
- ☆17Updated 5 months ago
- ☆83Updated last month
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆56Updated 10 months ago
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆33Updated last month
- ☆71Updated 5 months ago
- Official PyTorch code of GroundVQA (CVPR'24)☆60Updated 8 months ago
- ☆35Updated 10 months ago
- Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Prompting☆37Updated 2 weeks ago
- ☆117Updated 3 months ago
- Official repository for CoMM Dataset☆33Updated 4 months ago
- p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay☆35Updated 4 months ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆19Updated 7 months ago