A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
☆419Sep 22, 2025Updated 5 months ago
Alternatives and similar repositories for Awesome-Evaluation-of-Visual-Generation
Users that are interested in Awesome-Evaluation-of-Visual-Generation are comparing it to the libraries listed below
Sorting:
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,532Mar 13, 2026Updated last week
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆113Dec 4, 2025Updated 3 months ago
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated 2 years ago
- [CVPR 2025 满分论文 Ratings: 555]☆37May 9, 2025Updated 10 months ago
- Evaluating text-to-image/video/3D models with VQAScore☆381Sep 22, 2025Updated 5 months ago
- Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)☆44Oct 25, 2024Updated last year
- ②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.☆235Aug 12, 2024Updated last year
- ①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and vi…☆282Aug 12, 2024Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [ACMMM2025] Official released code for VQA² series models☆61Oct 19, 2025Updated 5 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 3 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,524Updated this week
- [ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"☆284Jun 12, 2024Updated last year
- A reading list of video generation☆678Updated this week
- [TPAMI] Multi-modality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing☆25Jul 3, 2025Updated 8 months ago
- 🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS…☆97Jul 18, 2024Updated last year
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆191Oct 3, 2024Updated last year
- [CSUR] A Survey on Video Diffusion Models☆2,282Updated this week
- 🥇[1st Official Code] Quality Assessment for AI-Generated Content - Track 1: Image AIGC内容质量评估冠军方案☆60Jul 29, 2025Updated 7 months ago
- [IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images☆37Feb 9, 2026Updated last month
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆676Oct 25, 2024Updated last year
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆389Mar 26, 2025Updated 11 months ago
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…☆280Nov 24, 2025Updated 3 months ago
- [TIP 2025] Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation☆10Jul 8, 2023Updated 2 years ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- Official Implementation of VideoDPO☆161Jun 1, 2025Updated 9 months ago
- PyTorch code for our paper "Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grain Image Quality Assessment"☆28Oct 7, 2024Updated last year
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆408May 30, 2025Updated 9 months ago
- ③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.☆580Mar 12, 2025Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆105Dec 9, 2024Updated last year
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆187Jan 30, 2026Updated last month
- Let's finetune video generation models!☆546Sep 15, 2025Updated 6 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,650Oct 29, 2025Updated 4 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 6 months ago
- A comprehensive collection of IQA papers☆1,470Mar 11, 2026Updated last week
- [ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"☆21Jul 9, 2024Updated last year