Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)
☆16Apr 23, 2024Updated last year
Alternatives and similar repositories for VALOR
Users that are interested in VALOR are comparing it to the libraries listed below
Sorting:
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 3 months ago
- Official implementation of "CONCRETE: Improving Cross-lingual Fact Checking with Cross-lingual Retrieval" (COLING'22)☆16Oct 13, 2022Updated 3 years ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 10 months ago
- VHTest☆15Oct 31, 2024Updated last year
- Official implementation of the ACL 2023 paper: "Faking Fake News for Real Fake News Detection: Propaganda-Loaded Training Data Generation…☆39Aug 7, 2023Updated 2 years ago
- ☆19Mar 12, 2025Updated 11 months ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 11 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆155Jan 15, 2024Updated 2 years ago
- ☆30Feb 14, 2025Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆27Jun 5, 2024Updated last year
- Official repository for the MMFM challenge☆25Jun 18, 2024Updated last year
- ☆27Feb 15, 2025Updated last year
- M-HalDetect Dataset Release☆27Nov 4, 2023Updated 2 years ago
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆77Sep 12, 2024Updated last year
- Official Code and Data repository of our ACL 2021 paper X-FACT: A New Benchmark Dataset for Multilingual Fact Checking.☆27Oct 4, 2024Updated last year
- Aligning LMMs with Factually Augmented RLHF☆392Nov 1, 2023Updated 2 years ago
- pytorch☆10Apr 13, 2022Updated 3 years ago
- ☆15Feb 12, 2026Updated 3 weeks ago
- ☆11May 24, 2024Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- ☆11May 16, 2025Updated 9 months ago
- ☆12Dec 12, 2024Updated last year
- Detecting and Evaluating Medical Hallucinations in Large Vision Language Models☆11Jun 24, 2024Updated last year
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, ACL 2024 (main)☆13Sep 23, 2024Updated last year
- LLMEval☆11Feb 12, 2024Updated 2 years ago
- ☆15Jan 25, 2025Updated last year
- ☆17Nov 7, 2023Updated 2 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Multi-task UNet for medical image classification and saliency prediction☆15Jan 29, 2022Updated 4 years ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- ☆11Jun 5, 2023Updated 2 years ago
- Preprint | Previously at GenBio ICML 2025☆18Aug 20, 2025Updated 6 months ago
- Application of OpenAI tools such as Whisper, DALL-E, and ChatGPT to generate album covers from audio☆12May 31, 2023Updated 2 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- The official GitHub page for the survey paper "Content Generation Models in Computational Pathology: A Comprehensive Survey on Methods, A…☆21Oct 9, 2025Updated 4 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago