Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs
☆14Apr 19, 2025Updated 10 months ago
Alternatives and similar repositories for VidHal
Users that are interested in VidHal are comparing it to the libraries listed below
Sorting:
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Jan 28, 2024Updated 2 years ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆36Updated this week
- [ICML 2025] Official implementation of Spherical Diffusion Policy: A SE(3) Equivariant Visuomotor Policy with Spherical Fourier Represent…☆39Jul 8, 2025Updated 7 months ago
- PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]☆19Apr 14, 2024Updated last year
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.☆49Jun 5, 2025Updated 8 months ago
- ☆23Aug 2, 2024Updated last year
- ☆30Feb 14, 2025Updated last year
- Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper☆24Jan 15, 2021Updated 5 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆34Feb 13, 2025Updated last year
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR/GQA implemented in PyTorch☆27Aug 26, 2024Updated last year
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- ☆29Oct 20, 2025Updated 4 months ago
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆34Aug 5, 2025Updated 6 months ago
- ☆17Sep 10, 2025Updated 5 months ago
- ☆40Mar 11, 2020Updated 5 years ago
- DOMIAS, a density-based MIA model that aims to infer membership by targeting local overfitting of the generative model.☆12May 29, 2023Updated 2 years ago
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 3 months ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Mo…☆12Jul 6, 2024Updated last year
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Epub Highlighter highlights specified words in EPub w/o meaning.☆11Jul 26, 2017Updated 8 years ago
- Different bangla datasets for sentiment analysis on bangla text☆10Nov 26, 2022Updated 3 years ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Stream Overlay for Twitch Clip Highlights☆11May 21, 2021Updated 4 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- ☆10Sep 22, 2024Updated last year
- NAACL'2021: Non-Parametric Few-Shot Learning for Word Sense Disambiguation☆10Jul 1, 2021Updated 4 years ago
- Preprint | Previously at GenBio ICML 2025☆18Aug 20, 2025Updated 6 months ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Oct 25, 2021Updated 4 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- Play Chrome's Dinosaur Game with Reinforcement Learning☆11Jun 25, 2023Updated 2 years ago
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- A Java implementation of Avro Phonetic☆13Oct 7, 2017Updated 8 years ago
- Distributed k-nearest Neighbors using Locality Sensitive Hashing and SYCL☆10Jun 7, 2021Updated 4 years ago
- [ECCV 2024] Characterizing Robustness via Natural Input Gradients☆13Oct 14, 2024Updated last year