[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation modes. The dataset includes extensive contextual descriptions, counterintuitive images, and clear indicators of hallucination items.
☆32Apr 16, 2025Updated 11 months ago
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] GeoFormer for Homography Estimation☆35Dec 25, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of Diff-II☆27Feb 27, 2025Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆149Mar 23, 2024Updated 2 years ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated 3 weeks ago
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Sep 2, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆102Nov 30, 2025Updated 4 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆36Jul 14, 2025Updated 8 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆107Nov 23, 2024Updated last year
- ☆10May 16, 2025Updated 10 months ago
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆38Jan 12, 2026Updated 2 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆78Jul 13, 2024Updated last year
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆1,003Sep 27, 2025Updated 6 months ago
- Codebase for LLM Textual Hallucination Benchmark☆78Apr 25, 2025Updated 11 months ago
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information☆23Apr 13, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code of the Grounded MUIE model, REAMO☆11Dec 3, 2024Updated last year
- ☆10Oct 21, 2024Updated last year
- This is the dataset for the competition "Clinical Brain Computer Interfaces Challenge" to be held at WCCI 2020 at Glasgow. There are the …☆11Jan 20, 2022Updated 4 years ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆107Jan 9, 2026Updated 3 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- Pytorch implementation of Detective☆12Jul 11, 2024Updated last year
- ☆12Sep 22, 2021Updated 4 years ago
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.☆50Aug 21, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆55Mar 13, 2025Updated last year
- ☆13Dec 28, 2023Updated 2 years ago
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"