Stevetich / EventHallusionView external linksLinks
EventHallusion: Diagnosing Event Hallucinations in Video LLMs
☆34Aug 5, 2025Updated 6 months ago
Alternatives and similar repositories for EventHallusion
Users that are interested in EventHallusion are comparing it to the libraries listed below
Sorting:
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆25Sep 27, 2024Updated last year
- ☆24Oct 28, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆42Dec 16, 2025Updated last month
- [MM24 Oral] Identity-Driven Multimedia Forgery Detection via Reference Assistance☆119Jul 27, 2025Updated 6 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- ☆32Jul 29, 2024Updated last year
- [ECCV 2022] MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes official implementation☆16Feb 2, 2023Updated 3 years ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- Adversarial Examples Detection Benchmark☆17Dec 6, 2024Updated last year
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 9 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆64Jan 27, 2026Updated 2 weeks ago
- ☆20Jul 28, 2025Updated 6 months ago
- [EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning☆35Oct 22, 2025Updated 3 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- ☆21Jan 17, 2025Updated last year
- The reproduce of Transformer architecture in paper "Attention is all your need"☆18May 15, 2020Updated 5 years ago
- [ICLR 2025] BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks☆30Nov 2, 2025Updated 3 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆101Nov 21, 2024Updated last year
- ☆160Jan 16, 2025Updated last year
- ☆58Aug 7, 2023Updated 2 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆23Jan 26, 2025Updated last year
- ☆68Feb 5, 2026Updated last week
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 10 months ago
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆29Dec 27, 2023Updated 2 years ago
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …☆128Apr 4, 2025Updated 10 months ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- A reading list of papers about Visual Grounding.☆32Aug 24, 2022Updated 3 years ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186May 21, 2025Updated 8 months ago
- Open-source red teaming framework for MLLMs with 37+ attack methods☆221Jan 16, 2026Updated 3 weeks ago
- [AAAI 2025] Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification☆19Apr 17, 2025Updated 9 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- ☆33Aug 19, 2023Updated 2 years ago
- LMM solved catastrophic forgetting, AAAI2025☆45Apr 15, 2025Updated 9 months ago
- Instituto de Telecomunicações Deep Learning-based Point Cloud Codec☆11Jun 18, 2024Updated last year
- ☆10Jul 6, 2021Updated 4 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 2 months ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆100Jan 30, 2024Updated 2 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 2 months ago