meituan-longcat / LongCat-VideoLinks
☆1,921Updated last month
Alternatives and similar repositories for LongCat-Video
Users that are interested in LongCat-Video are comparing it to the libraries listed below
Sorting:
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,080Updated 3 weeks ago
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆1,407Updated 3 weeks ago
- Light Image Video Generation Inference Framework☆1,822Updated this week
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,654Updated 2 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆863Updated 4 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆670Updated 3 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,200Updated 3 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆697Updated 7 months ago
- [ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling☆1,354Updated 2 weeks ago
- Official inference repo for FLUX.2 models☆1,540Updated this week
- Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.☆2,900Updated this week
- ☆1,999Updated last month
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,345Updated 4 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Updated 4 months ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆713Updated last month
- ☆1,046Updated 8 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,474Updated 4 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆580Updated 7 months ago
- 🔥🔥 Open-sourced unified customization model☆1,199Updated 4 months ago
- PersonaLive! : Expressive Portrait Image Animation for Live Streaming☆1,388Updated 3 weeks ago
- ☆368Updated 10 months ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆591Updated last month
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,172Updated 5 months ago
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆620Updated 3 weeks ago
- Native Multimodal Models are World Learners☆1,406Updated 3 weeks ago
- MoCha: End-to-End Video Character Replacement without Structural Guidance☆597Updated last week
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆1,181Updated 3 weeks ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆689Updated last month
- The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"☆561Updated last week
- ☆572Updated last week