FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models
☆33Nov 27, 2025Updated 7 months ago
Alternatives and similar repositories for FAITHSCORE
Users that are interested in FAITHSCORE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HallE-Control: Controlling Object Hallucination in LMMs☆32Apr 10, 2024Updated 2 years ago
- Some papers about *diverse* image (a few videos) captioning☆25Apr 4, 2023Updated 3 years ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆263Aug 21, 2025Updated 10 months ago
- ☆18Aug 1, 2024Updated last year
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆169Jan 15, 2024Updated 2 years ago
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- The Official Code Repo for EgoOrientBench [CVPR25]☆17Nov 24, 2025Updated 7 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆49Aug 21, 2024Updated last year
- ☆42May 12, 2025Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆93Apr 30, 2024Updated 2 years ago
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆17Feb 25, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository is unmaintained, please see lumo for details.☆10Mar 19, 2023Updated 3 years ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆76Oct 16, 2024Updated last year
- ☆97Mar 29, 2019Updated 7 years ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆87Oct 26, 2025Updated 8 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68May 31, 2024Updated 2 years ago
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- ☆17Jul 23, 2025Updated 11 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆64Apr 21, 2024Updated 2 years ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆108Dec 9, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆47Nov 10, 2024Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated last year
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 5 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆40Nov 10, 2024Updated last year
- Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets☆12May 25, 2023Updated 3 years ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆115Dec 4, 2024Updated last year
- Training code for CLIP-FlanT5☆31Jul 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆30Apr 23, 2026Updated 2 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated 2 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆21Oct 8, 2023Updated 2 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year