[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation modes. The dataset includes extensive contextual descriptions, counterintuitive images, and clear indicators of hallucination items.
β32Apr 16, 2025Updated last year
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] PyTorch implementation of Diff-IIβ27Feb 27, 2025Updated last year
- π curated list of awesome LMM hallucinations papers, methods & resources.β149Mar 23, 2024Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrievalβ42May 7, 2025Updated 11 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Explorationβ16Mar 18, 2026Updated last month
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".β18Mar 13, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"β102Nov 30, 2025Updated 5 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steeringβ110Nov 23, 2024Updated last year
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)β24May 26, 2025Updated 11 months ago
- TIFS2022: Decision-based Adversarial Attack with Frequency Mixupβ22Aug 8, 2023Updated 2 years ago
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?β39Jan 12, 2026Updated 3 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Modelsβ79Jul 13, 2024Updated last year
- Codebase for LLM Textual Hallucination Benchmarkβ79Apr 25, 2025Updated last year
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"β15Jan 25, 2024Updated 2 years ago
- Code of the Grounded MUIE model, REAMOβ10Dec 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β10Oct 21, 2024Updated last year
- β10Jan 19, 2022Updated 4 years ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Keyβ109Jan 9, 2026Updated 3 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Visβ¦β25Jul 21, 2024Updated last year
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsificationβ50Mar 24, 2025Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Modeβ¦β26Sep 26, 2024Updated last year
- β12Sep 22, 2021Updated 4 years ago
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.β50Aug 21, 2025Updated 8 months ago
- We're Not Using Videos Effectively (TMLR 2024)β17Feb 4, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(β¦β336Oct 14, 2025Updated 6 months ago
- β¨β¨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audioβ54Jul 11, 2025Updated 9 months ago
- [ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoningβ46Nov 8, 2025Updated 5 months ago
- Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)β11Nov 4, 2023Updated 2 years ago
- (NeXD @ CVPR 2025) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Modelsβ28Sep 30, 2025Updated 7 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ142Sep 11, 2025Updated 7 months ago
- Code for CVPR 2019 paperβ12Apr 26, 2019Updated 7 years ago
- β15Aug 30, 2025Updated 8 months ago
- β16Jun 14, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- W2VV++: A fully deep learning solution for ad-hoc video searchβ29Jul 25, 2024Updated last year
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signalsβ11Jan 8, 2026Updated 3 months ago
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMsβ168Nov 6, 2024Updated last year
- Continual Online Recalibration with Pseudo-labelsβ14Jun 20, 2024Updated last year
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMsβ14Apr 23, 2026Updated last week
- This repository contains the code for our paper "Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering" [EMNLPβ¦β15Oct 8, 2024Updated last year
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignmentβ31Dec 17, 2025Updated 4 months ago