Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
☆105Dec 9, 2024Updated last year
Alternatives and similar repositories for DSG
Users that are interested in DSG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆182Apr 29, 2024Updated last year
- Evaluating text-to-image/video/3D models with VQAScore☆381Sep 22, 2025Updated 6 months ago
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆89Feb 13, 2024Updated 2 years ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆154Jun 25, 2024Updated last year
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆651May 24, 2024Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignment☆435Mar 3, 2025Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆67Nov 19, 2024Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- ☆26Feb 3, 2023Updated 3 years ago
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆143Jun 10, 2025Updated 9 months ago
- Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"☆13Mar 10, 2024Updated 2 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆334Dec 24, 2025Updated 3 months ago
- ☆16Oct 21, 2024Updated last year
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- ☆582Dec 21, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Mar 30, 2025Updated 11 months ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆134Oct 25, 2023Updated 2 years ago
- Official Code for Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation (CVPR 2025)☆13Apr 2, 2025Updated 11 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,651Oct 29, 2025Updated 4 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- ☆360Jan 27, 2024Updated 2 years ago
- ☆12Mar 5, 2025Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- [ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark☆123Dec 11, 2023Updated 2 years ago
- ☆58Apr 24, 2024Updated last year
- FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models☆33Nov 27, 2025Updated 3 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆169Nov 18, 2024Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arx…☆12Feb 6, 2023Updated 3 years ago
- Holistic evaluation of multimodal foundation models☆49Aug 11, 2024Updated last year
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year