An Easy-to-use Hallucination Detection Framework for LLMs.
☆63Apr 21, 2024Updated last year
Alternatives and similar repositories for EasyDetect
Users that are interested in EasyDetect are comparing it to the libraries listed below
Sorting:
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆14Jun 12, 2024Updated last year
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆40Feb 25, 2025Updated last year
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆25Jan 31, 2025Updated last year
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]☆27Dec 28, 2024Updated last year
- FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models☆33Nov 27, 2025Updated 3 months ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆333Oct 14, 2025Updated 5 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- ☆20Jun 16, 2025Updated 9 months ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆409Dec 23, 2024Updated last year
- This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…☆25Nov 15, 2024Updated last year
- [ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"☆44Feb 27, 2026Updated 3 weeks ago
- ☆20Feb 27, 2023Updated 3 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆137Jul 28, 2025Updated 7 months ago
- ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models☆650Dec 23, 2024Updated last year
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆994Sep 27, 2025Updated 5 months ago
- ☆30Aug 21, 2025Updated 6 months ago
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆46Nov 17, 2023Updated 2 years ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆307Sep 11, 2024Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆253Aug 21, 2025Updated 6 months ago
- 🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)☆65Dec 9, 2023Updated 2 years ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision☆42Oct 19, 2025Updated 5 months ago
- OceanGym: A Benchmark Environment for Underwater Embodied Agents☆100Jan 29, 2026Updated last month
- A Survey on Benchmarks of Multimodal Large Language Models☆150Jul 1, 2025Updated 8 months ago
- Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆69Dec 17, 2025Updated 3 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆80Mar 6, 2026Updated 2 weeks ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated 9 months ago
- [IJCAI 2021] Document-level Relation Extraction as Semantic Segmentation☆155Dec 6, 2022Updated 3 years ago
- 哈尔滨工业大学(深圳)2021年球季学期深度学习体系结构实验☆17Oct 1, 2022Updated 3 years ago
- ☆18Apr 7, 2025Updated 11 months ago
- ☆22Sep 20, 2022Updated 3 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 4 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆42Dec 16, 2025Updated 3 months ago
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents☆318Apr 16, 2024Updated last year
- 【ACL 2024】 SALAD benchmark & MD-Judge☆171Mar 8, 2025Updated last year