aiming-lab / MJ-Video
MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
☆12Updated 2 months ago
Alternatives and similar repositories for MJ-Video:
Users that are interested in MJ-Video are comparing it to the libraries listed below
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆36Updated 3 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆12Updated last month
- Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆16Updated 2 weeks ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆16Updated this week
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆21Updated 3 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆53Updated 8 months ago
- ☆11Updated 7 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆42Updated 2 months ago
- Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]☆19Updated 7 months ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 2 months ago
- ☆35Updated 9 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆38Updated 10 months ago
- Official implement of MIA-DPO☆56Updated 3 months ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆19Updated 7 months ago
- ☆40Updated 9 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆81Updated 3 weeks ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆31Updated 5 months ago
- Video Generation Benchmark☆16Updated 2 weeks ago
- VisRL: Intention-Driven Visual Perception via Reinforced Reasoning☆27Updated last month
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆38Updated 11 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆72Updated 10 months ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆38Updated last month
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆46Updated 5 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 6 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆55Updated last month
- LLMBind: A Unified Modality-Task Integration Framework☆18Updated 10 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Updated 10 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- ☆21Updated last year