ziqihuangg/Awesome-Evaluation-of-Visual-Generation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ziqihuangg/Awesome-Evaluation-of-Visual-Generation)

ziqihuangg / Awesome-Evaluation-of-Visual-Generation

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

☆454

Alternatives and similar repositories for Awesome-Evaluation-of-Visual-Generation

Users that are interested in Awesome-Evaluation-of-Visual-Generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,700Mar 23, 2026Updated 3 months ago
zzc-1998 / MLLM-QA-Papers-with-Code
View on GitHub
Collections of papers and code for employing MLLM for quality assessment tasks.
☆12Apr 18, 2024Updated 2 years ago
TIGER-AI-Lab / VideoScore
View on GitHub
official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
☆121Dec 4, 2025Updated 7 months ago
Q-Future / Q-Refine
View on GitHub
[MM 2024 Oral] Refiner for AIGC
☆28Jul 29, 2024Updated last year
Coobiw / TriVQA
View on GitHub
[CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…
☆13Jun 14, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
llyx97 / FETV
View on GitHub
[NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…
☆57Mar 4, 2024Updated 2 years ago
Q-Future / Q-Ground
View on GitHub
Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)
☆47Apr 21, 2026Updated 3 months ago
Q-Future / Q-Instruct
View on GitHub
②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.
☆238Aug 12, 2024Updated last year
lcysyzxdxc / MPD
View on GitHub
[CVPR 2025 满分论文 Ratings: 555]
☆38May 9, 2025Updated last year
showlab / T2VScore
View on GitHub
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆81Apr 10, 2024Updated 2 years ago
Q-Future / Q-Bench
View on GitHub
①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and vi…
☆287Aug 12, 2024Updated last year
Q-Future / Visual-Question-Answering-for-Video-Quality-Assessment
View on GitHub
[ACMMM2025] Official released code for VQA² series models
☆68Apr 21, 2026Updated 2 months ago
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆421Mar 26, 2025Updated last year
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,722Jun 16, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
3DTopia / GPTEval3D
View on GitHub
[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"
☆288Jun 12, 2024Updated 2 years ago
yzhang2016 / video-generation-survey
View on GitHub
A reading list of video generation
☆721Jul 10, 2026Updated last week
cocoshe / I2EBench
View on GitHub
[NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
☆35Dec 9, 2025Updated 7 months ago
linzhiqiu / t2v_metrics
View on GitHub
Evaluating text-to-image/video/3D models with VQAScore
☆595Jun 5, 2026Updated last month
evalcrafter / EvalCrafter
View on GitHub
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
☆193Oct 3, 2024Updated last year
yipoh / AesNet
View on GitHub
[TPAMI] Multi-modality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing
☆24Jul 3, 2025Updated last year
taco-group / COVER
View on GitHub
🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS…
☆99Jul 18, 2024Updated 2 years ago
ChenHsing / Awesome-Video-Diffusion-Models
View on GitHub
[CSUR] A Survey on Video Diffusion Models
☆2,303Jun 22, 2026Updated 3 weeks ago
woshidandan / Champion-Solution-for-CVPR-NTIRE-2024-Quality-Assessment-on-AIGC
View on GitHub
🥇[1st Official Code] Quality Assessment for AI-Generated Content - Track 1: Image AIGC内容质量评估冠军方案
☆61Jul 29, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zijianchen98 / AGIN
View on GitHub
[IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images
☆38Apr 29, 2026Updated 2 months ago
snap-research / Panda-70M
View on GitHub
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆700Oct 25, 2024Updated last year
zzc-1998 / SJTU-H3D
View on GitHub
[TIP 2025] Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation
☆12Jul 8, 2023Updated 3 years ago
wenhao728 / awesome-diffusion-v2v
View on GitHub
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…
☆291Apr 8, 2026Updated 3 months ago
Kai-Liu001 / Dog-IQA
View on GitHub
PyTorch code for our paper "Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grain Image Quality Assessment"
☆28Oct 7, 2024Updated last year
CIntellifusion / VideoDPO
View on GitHub
Official Implementation of VideoDPO
☆169Jun 1, 2025Updated last year
Vchitect / Evaluation-Agent
View on GitHub
[ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible
☆128Aug 10, 2025Updated 11 months ago
MizzenAI / HPSv3
View on GitHub
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)
☆325Dec 5, 2025Updated 7 months ago
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,694Oct 29, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
QMME / T2VQA
View on GitHub
☆27Nov 27, 2024Updated last year
NJU-PCALab / OpenVid-1M
View on GitHub
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
☆452May 30, 2025Updated last year
j-min / DSG
View on GitHub
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
☆109Dec 9, 2024Updated last year
zeyofu / Commonsense-T2I
View on GitHub
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
☆24Aug 13, 2024Updated last year
synvo-ai / HippoCamp
View on GitHub
A benchmark for evaluating contextual agents on realistic multimodal personal-computer environments with profiling and factual-retention …
☆29Apr 2, 2026Updated 3 months ago
yuvalkirstain / PickScore
View on GitHub
☆600Dec 21, 2024Updated last year
Q-Future / Q-Align
View on GitHub
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
☆608Jun 24, 2026Updated 3 weeks ago