official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
☆113Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for VideoScore
Users that are interested in VideoScore are comparing it to the libraries listed below
Sorting:
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆86May 4, 2025Updated 10 months ago
- Official Implementation of VideoDPO☆161Jun 1, 2025Updated 9 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- [ICCV 2025] Prompt-A-Video☆22Feb 2, 2025Updated last year
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆45Mar 3, 2026Updated 2 weeks ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆670Nov 10, 2025Updated 4 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆437Sep 24, 2025Updated 5 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆389Mar 26, 2025Updated 11 months ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated 2 years ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,532Mar 13, 2026Updated last week
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆419Sep 22, 2025Updated 5 months ago
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]☆239Jan 3, 2026Updated 2 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆184Sep 26, 2024Updated last year
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆107Oct 25, 2025Updated 4 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆191Oct 3, 2024Updated last year
- [ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆38Nov 27, 2024Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆750Mar 22, 2024Updated last year
- ☆15Mar 30, 2025Updated 11 months ago
- ☆11Jan 18, 2025Updated last year
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 6 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 7 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,895Jan 8, 2026Updated 2 months ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆509Sep 2, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆649May 24, 2024Updated last year
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆53Jul 23, 2025Updated 7 months ago
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆70Aug 16, 2025Updated 7 months ago
- Evaluation codes and data for GenEval2☆60Jan 8, 2026Updated 2 months ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆950Nov 13, 2024Updated last year
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆408May 30, 2025Updated 9 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆187Jan 30, 2026Updated last month
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 7 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆67Nov 19, 2024Updated last year
- [ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.☆408Feb 6, 2026Updated last month
- ☆26Jun 22, 2024Updated last year
- [NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation☆210Mar 8, 2026Updated last week