ziqihuangg / Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
☆197Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Evaluation-of-Visual-Generation
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆211Updated 2 weeks ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆141Updated last month
- Official code of SmartEdit [CVPR-2024 Highlight]☆254Updated 5 months ago
- ☆193Updated 4 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆106Updated last month
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆50Updated this week
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆105Updated 4 months ago
- 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).☆361Updated last week
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆371Updated 2 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆526Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆215Updated 2 weeks ago
- The paper collections for the autoregressive models in vision.☆229Updated this week
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆580Updated 2 weeks ago
- ☆127Updated 2 weeks ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆120Updated 3 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆393Updated 6 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆207Updated last month
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆157Updated 4 months ago
- A collection of awesome video generation studies.☆346Updated this week
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆261Updated 8 months ago
- 🚀 Cross attention map tools for huggingface/diffusers☆153Updated last week
- ☆93Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.☆110Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆150Updated last month
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆267Updated 6 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆115Updated last month
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆111Updated this week
- ☆93Updated 4 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆100Updated 4 months ago