[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation modes. The dataset includes extensive contextual descriptions, counterintuitive images, and clear indicators of hallucination items.
☆32Apr 16, 2025Updated last year
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] GeoFormer for Homography Estimation☆35Dec 25, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of Diff-II☆27Feb 27, 2025Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆150Mar 23, 2024Updated 2 years ago
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Sep 2, 2025Updated 9 months ago
- In OLHWDB ,you can find the ptts files, this code can help you get the information of the ptts☆11Mar 8, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Mar 13, 2023Updated 3 years ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆103Nov 30, 2025Updated 6 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆112Nov 23, 2024Updated last year
- ☆10May 16, 2025Updated last year
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆38Jan 12, 2026Updated 4 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆79Jul 13, 2024Updated last year
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆1,021Sep 27, 2025Updated 8 months ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- Code of the Grounded MUIE model, REAMO☆10Dec 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Oct 21, 2024Updated last year
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 7 months ago
- ☆10Jan 19, 2022Updated 4 years ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆110Jan 9, 2026Updated 5 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆50Mar 24, 2025Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆26Sep 26, 2024Updated last year
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.