FoundationVision / WaverLinks
Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
☆762Updated 3 months ago
Alternatives and similar repositories for Waver
Users that are interested in Waver are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆626Updated 2 weeks ago
- Pusa: Thousands Timesteps Video Diffusion Model☆666Updated 3 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆442Updated last week
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆453Updated 9 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆661Updated last month
- LongLive: Real-time Interactive Long Video Generation☆890Updated last week
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆301Updated 5 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆394Updated 3 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆994Updated last month
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆342Updated last month
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆269Updated 6 months ago
- ☆1,317Updated last month
- ☆383Updated 4 months ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆539Updated 2 weeks ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆522Updated last month
- Native Multimodal Models are World Learners☆1,324Updated 2 weeks ago
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆617Updated 3 weeks ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆812Updated 2 weeks ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆486Updated last month
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,091Updated 4 months ago
- Let's finetune video generation models!☆525Updated 2 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆353Updated 4 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆208Updated last month
- MotionStream: Real-Time Video Generation with Interactive Motion Controls☆423Updated 3 weeks ago
- Community trainer for Lightricks' LTX Video model 🎬 ⚡️☆360Updated last month
- Official inference repo for FLUX.2 models☆1,170Updated last week
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆415Updated 6 months ago
- Unofficial extension implementation of Self-Forcing to support I2V && 14B training.☆274Updated 2 months ago
- [ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling☆702Updated this week
- ☆282Updated 4 months ago