[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation modes. The dataset includes extensive contextual descriptions, counterintuitive images, and clear indicators of hallucination items.
β31Apr 16, 2025Updated 10 months ago
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below
Sorting:
- π curated list of awesome LMM hallucinations papers, methods & resources.β150Mar 23, 2024Updated last year
- [CVPR 2025] PyTorch implementation of Diff-IIβ24Feb 27, 2025Updated last year
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".β18Mar 13, 2023Updated 2 years ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"β95Nov 30, 2025Updated 3 months ago
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?β37Jan 12, 2026Updated last month
- [CVPR 2024] TeachCLIP for Text-to-Video Retrievalβ42May 7, 2025Updated 9 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Modelsβ77Jul 13, 2024Updated last year
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Modelsβ70Jan 8, 2026Updated last month
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steeringβ103Nov 23, 2024Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Keyβ105Jan 9, 2026Updated last month
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attβ¦β63Oct 9, 2025Updated 4 months ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projectionβ51Mar 13, 2025Updated 11 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Modelsβ46Sep 8, 2025Updated 5 months ago
- β10Oct 21, 2024Updated last year
- [ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoningβ45Nov 8, 2025Updated 3 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsificationβ49Mar 24, 2025Updated 11 months ago
- π A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).β983Sep 27, 2025Updated 5 months ago
- β¨β¨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audioβ52Jul 11, 2025Updated 7 months ago
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.β50Aug 21, 2025Updated 6 months ago
- β13Jul 22, 2022Updated 3 years ago
- β13Jun 5, 2023Updated 2 years ago
- πOfficial code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".β47Mar 18, 2025Updated 11 months ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to vβ¦β14Apr 14, 2025Updated 10 months ago
- Empowering Small VLMs to Think with Dynamic Memorization and Explorationβ15Nov 18, 2025Updated 3 months ago
- Efficient Cross-modality Graph Reasoning for RGB-Infrared Person Re-identificationβ10Sep 19, 2021Updated 4 years ago
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactualsβ12May 24, 2024Updated last year
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signalsβ11Jan 8, 2026Updated last month
- β18Aug 7, 2025Updated 6 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoderβ18Oct 19, 2025Updated 4 months ago
- β16Jun 14, 2024Updated last year
- In OLHWDB ,you can find the ptts files, this code can help you get the information of the pttsβ11Mar 8, 2022Updated 3 years ago
- Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)β11Nov 4, 2023Updated 2 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inferenceβ10Dec 15, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMsβ23Sep 21, 2025Updated 5 months ago
- β10Jan 19, 2022Updated 4 years ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ135Sep 11, 2025Updated 5 months ago
- β28Dec 4, 2025Updated 2 months ago
- An undergraduate thesis project.β11Jul 13, 2024Updated last year
- Code of the Grounded MUIE model, REAMOβ11Dec 3, 2024Updated last year