jialuli-luka / Video-MSGLinks
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆24Updated 9 months ago
Alternatives and similar repositories for Video-MSG
Users that are interested in Video-MSG are comparing it to the libraries listed below
Sorting:
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Updated 6 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Updated 11 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆115Updated 3 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Updated 5 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Updated 2 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Updated 10 months ago
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 3 months ago
- ☆47Updated 9 months ago
- ☆52Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆77Updated 6 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆51Updated this week
- ☆35Updated last month
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 9 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53Updated 8 months ago
- Video Diffusion Transformers are In-Context Learners☆36Updated last year
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆60Updated 6 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85Updated 9 months ago
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generation☆32Updated 3 weeks ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆76Updated last year
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Updated 6 months ago
- [Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…☆57Updated 4 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆33Updated last month
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Updated last year
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆157Updated 3 weeks ago
- ☆22Updated last year
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆57Updated 3 months ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆96Updated 8 months ago
- ☆100Updated 2 weeks ago
- ☆55Updated 9 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Updated last year