ziqihuangg / Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
☆158Updated last month
Related projects: ⓘ
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆190Updated last month
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆118Updated 2 weeks ago
- Official code of SmartEdit [CVPR-2024 Highlight]☆227Updated 3 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆96Updated 2 months ago
- 🚀 Cross attention map tools for huggingface/diffusers☆115Updated 2 months ago
- ☆168Updated 2 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation"☆37Updated this week
- The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.☆105Updated last year
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆252Updated 4 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆91Updated 2 months ago
- 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).☆293Updated 3 weeks ago
- ☆113Updated 2 months ago
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆232Updated 6 months ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆117Updated 4 months ago
- Unified Multi-modal IAA Baseline and Benchmark☆68Updated 5 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆196Updated last week
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆144Updated 2 weeks ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆89Updated 3 weeks ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- A reading list of video generation☆362Updated this week
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆490Updated 2 weeks ago
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆129Updated 2 months ago
- ☆89Updated 9 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆107Updated 2 months ago
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆112Updated 2 months ago
- ☆89Updated 4 months ago
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆146Updated 5 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆129Updated 4 months ago
- Benchmark for generative image models☆53Updated last year
- You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.☆199Updated 3 months ago