Yushi-Hu / tifaView external linksLinks
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
☆181Apr 29, 2024Updated last year
Alternatives and similar repositories for tifa
Users that are interested in tifa are comparing it to the libraries listed below
Sorting:
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆104Dec 9, 2024Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- Evaluating text-to-image/video/3D models with VQAScore☆374Sep 22, 2025Updated 4 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆645May 24, 2024Updated last year
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆27Nov 11, 2023Updated 2 years ago
- ☆578Dec 21, 2024Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Apr 10, 2024Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆330Dec 24, 2025Updated last month
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,635Oct 29, 2025Updated 3 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆153Jun 25, 2024Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Training code for CLIP-FlanT5☆30Jul 29, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆310Nov 1, 2024Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆51Oct 14, 2024Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆306Mar 12, 2025Updated 11 months ago
- ☆37Oct 7, 2023Updated 2 years ago
- ☆14Jul 5, 2024Updated last year
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆68Jun 23, 2023Updated 2 years ago
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆134Oct 25, 2023Updated 2 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆109Jan 23, 2024Updated 2 years ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,475Updated this week
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆89Feb 13, 2024Updated 2 years ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆189Oct 3, 2024Updated last year
- VisualGPTScore for visio-linguistic reasoning