[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆21Mar 26, 2025Updated 11 months ago
Alternatives and similar repositories for BEAF
Users that are interested in BEAF are comparing it to the libraries listed below
Sorting:
- [NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"☆15Jun 18, 2024Updated last year
- [AAAI'24] Official PyTorch implementation of the paper "FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radianc…☆18Nov 29, 2024Updated last year
- [RA-L'24, IROS'24] Official PyTorch Implementation of "Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation"☆13Oct 11, 2024Updated last year
- ☆40Apr 14, 2025Updated 10 months ago
- [TMLR'24, ICLR'25] Official repository for "NeuFace: A Large-Scale 3D Face Mesh Video Dataset via Neural Re-parameterized Optimization"☆32Jul 7, 2025Updated 7 months ago
- ☆16Oct 21, 2024Updated last year
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆32Sep 30, 2024Updated last year
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆19Jun 25, 2025Updated 8 months ago
- [ICCV’23] Official repository for "TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation"☆22Nov 1, 2023Updated 2 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆14Nov 25, 2025Updated 3 months ago
- ☆10Jul 5, 2024Updated last year
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- [ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"☆29Nov 27, 2025Updated 3 months ago
- ☆11Oct 2, 2024Updated last year
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 9 months ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Training code for CLIP-FlanT5☆30Jul 29, 2024Updated last year
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- [ECCV'24] Official PyTorch Implementation of "Learning-based Axial Video Motion Magnification"☆24Dec 22, 2024Updated last year
- [BMVC'25] Official repository for "Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation"☆23Dec 8, 2025Updated 2 months ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆96Oct 19, 2024Updated last year
- Research Papers on Efficient Neural Fields from EffL Group☆16Apr 21, 2025Updated 10 months ago
- ☆17Aug 1, 2024Updated last year
- ☆18Jul 24, 2023Updated 2 years ago
- Official repo for Discriminator Guidance for ImageNet256.☆13Apr 27, 2023Updated 2 years ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆20Sep 11, 2024Updated last year
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- Codebase for "VLMaterial: Procedural Material Generation with Large Vision-Language Models"☆46Feb 18, 2025Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- [AAAI' 25] Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior☆42Feb 11, 2025Updated last year
- ☆47Nov 8, 2024Updated last year
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆47Jun 2, 2025Updated 9 months ago
- SIM4D☆30Mar 27, 2025Updated 11 months ago
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆19Dec 16, 2024Updated last year
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆52Jun 16, 2025Updated 8 months ago