official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
☆113Dec 4, 2025Updated 2 months ago
Alternatives and similar repositories for VideoScore
Users that are interested in VideoScore are comparing it to the libraries listed below
Sorting:
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 8 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆307Mar 12, 2025Updated 11 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆379Mar 26, 2025Updated 11 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆428Sep 24, 2025Updated 5 months ago
- [ICCV 2025] Prompt-A-Video☆22Feb 2, 2025Updated last year
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆311Jan 31, 2025Updated last year
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,485Updated this week
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆661Nov 10, 2025Updated 3 months ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆414Sep 22, 2025Updated 5 months ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆106Oct 25, 2025Updated 4 months ago
- ☆11Jan 18, 2025Updated last year
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 7 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆176Sep 26, 2024Updated last year
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]☆239Jan 3, 2026Updated last month
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆42Feb 18, 2026Updated last week
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 6 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆47Jul 5, 2025Updated 7 months ago
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆744Mar 22, 2024Updated last year
- [ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆38Nov 27, 2024Updated last year
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆504Sep 2, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,875Jan 8, 2026Updated last month
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆190Oct 3, 2024Updated last year
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆180Jan 30, 2026Updated last month
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 6 months ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆948Nov 13, 2024Updated last year
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Aug 16, 2025Updated 6 months ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆149Oct 25, 2024Updated last year
- ☆15Mar 30, 2025Updated 11 months ago
- ☆23Jul 20, 2025Updated 7 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆646May 24, 2024Updated last year
- VideoGen-Eval: Agent-based System for Video Generation Evaluation☆255Dec 16, 2025Updated 2 months ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆86Jan 27, 2025Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.☆403Feb 6, 2026Updated 3 weeks ago
- ☆42Oct 20, 2025Updated 4 months ago