[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation modes. The dataset includes extensive contextual descriptions, counterintuitive images, and clear indicators of hallucination items.
β32Apr 16, 2025Updated last year
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] GeoFormer for Homography Estimationβ35Dec 25, 2023Updated 2 years ago
- π curated list of awesome LMM hallucinations papers, methods & resources.β150Mar 23, 2024Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrievalβ42May 7, 2025Updated last year
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Explorationβ17Mar 18, 2026Updated 2 months ago
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.β17Sep 2, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"β102Nov 30, 2025Updated 5 months ago
- β11Oct 9, 2021Updated 4 years ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steeringβ110Nov 23, 2024Updated last year
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)β24May 26, 2025Updated 11 months ago
- TIFS2022: Decision-based Adversarial Attack with Frequency Mixupβ22Aug 8, 2023Updated 2 years ago
- β10May 16, 2025Updated last year
- Real-time Relevant Recommendation Suggestionβ16Jul 31, 2022Updated 3 years ago
- InternalandContextualAttentionNetworkforCold-startMulti-channel MatchinginRecommendationβ17Jun 14, 2021Updated 4 years ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Modelsβ79Jul 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Informationβ23Apr 13, 2025Updated last year
- Codebase for LLM Textual Hallucination Benchmarkβ80Apr 25, 2025Updated last year
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"β15Jan 25, 2024Updated 2 years ago
- Code of the Grounded MUIE model, REAMOβ10Dec 3, 2024Updated last year
- β10Oct 21, 2024Updated last year
- β10Jan 19, 2022Updated 4 years ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Keyβ110Jan 9, 2026Updated 4 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Visβ¦β25Jul 21, 2024Updated last year
- Pytorch implementation of Detectiveβ13Jul 11, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsificationβ50Mar 24, 2025Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Modeβ¦β26Sep 26, 2024Updated last year
- β12Sep 22, 2021Updated 4 years ago
- β14Dec 28, 2023Updated 2 years ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projectionβ61Mar 13, 2025Updated last year
- Identifying the compiler family, version and compiler flags that generated a binaryβ20Dec 19, 2019Updated 6 years ago
- This is the dataset for the competition "Clinical Brain Computer Interfaces Challenge" to be held at WCCI 2020 at Glasgow. There are the β¦β12Jan 20, 2022Updated 4 years ago
- Code for our TKDE paper "Understanding WeChat User Preferences and βWowβ Diffusion"β20Aug 29, 2024Updated last year
- We're Not Using Videos Effectively (TMLR 2024)β17Feb 4, 2024Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(β¦β337Oct 14, 2025Updated 7 months ago
- β¨β¨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audioβ54Jul 11, 2025Updated 10 months ago
- [ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoningβ46Nov 8, 2025Updated 6 months ago
- Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)β11Nov 4, 2023Updated 2 years ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ144Sep 11, 2025Updated 8 months ago
- The official PyTorch code for AAAI'23 Paper "Sparse Coding in a Dual Memory System for Lifelong Learning"β12Feb 15, 2023Updated 3 years ago
- Code for CVPR 2019 paperβ12Apr 26, 2019Updated 7 years ago