FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models
☆32Nov 27, 2025Updated 3 months ago
Alternatives and similar repositories for FAITHSCORE
Users that are interested in FAITHSCORE are comparing it to the libraries listed below
Sorting:
- ☆17Aug 1, 2024Updated last year
- HallE-Control: Controlling Object Hallucination in LMMs☆31Apr 10, 2024Updated last year
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆248Aug 21, 2025Updated 6 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆100Jan 30, 2024Updated 2 years ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆47Aug 21, 2024Updated last year
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- The Official Code Repo for EgoOrientBench [CVPR25]☆14Nov 24, 2025Updated 3 months ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆91Apr 30, 2024Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆70Feb 28, 2024Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.☆22Oct 27, 2025Updated 4 months ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- ☆21Jun 27, 2024Updated last year
- ☆19Feb 21, 2024Updated 2 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆24Jul 21, 2024Updated last year
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- Learning visually grounded word embeddings using Abstract scenes☆18Mar 1, 2019Updated 7 years ago
- ☆26Jul 13, 2023Updated 2 years ago
- MATE: Masked Autoencoders are Online 3D Test-Time Learners (ICCV 2023)☆22Jul 22, 2023Updated 2 years ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated last year
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆22Sep 26, 2024Updated last year
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- ☆93Mar 29, 2019Updated 6 years ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆104Dec 9, 2024Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- This is the official code implement for AAAI 2025 paper ``Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimizat…☆22Mar 21, 2025Updated 11 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆47Nov 10, 2024Updated last year
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆109Dec 4, 2024Updated last year
- ☆25Mar 26, 2024Updated last year
- ☆117Feb 11, 2025Updated last year
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Nov 13, 2024Updated last year
- Training code for CLIP-FlanT5☆30Jul 29, 2024Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 10 months ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆37Nov 10, 2024Updated last year