hanghuacs / MMComposition
☆10Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MMComposition
- 🏆 See How Top MLLMs Understand Video Compositions.☆14Updated this week
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆127Updated 6 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆128Updated 4 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆91Updated 8 months ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆197Updated 2 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆143Updated last month
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆94Updated last month
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆37Updated 2 months ago
- This is the official implementation for ControlVAR.☆55Updated last month
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆89Updated last year
- Official implementation for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way☆18Updated last month
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆218Updated 4 months ago
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆138Updated 4 months ago
- ☆21Updated 6 months ago
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆48Updated 2 months ago
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection☆75Updated 4 months ago
- Accepted by CVPR 2024☆28Updated 6 months ago
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆267Updated 6 months ago
- ☆65Updated 5 months ago
- Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos☆38Updated 3 months ago
- ☆26Updated 4 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆55Updated last week
- Empowering Unified MLLM with Multi-granular Visual Generation☆106Updated last month
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆50Updated this week
- ☆137Updated 4 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆37Updated this week
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆245Updated 3 weeks ago
- Training-Free Condition-Guided Text-to-Video Generation☆57Updated 10 months ago
- ☆193Updated 4 months ago