FoundationVision / WaverLinks
Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
☆863Updated 4 months ago
Alternatives and similar repositories for Waver
Users that are interested in Waver are comparing it to the libraries listed below
Sorting:
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆670Updated 3 months ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆591Updated last month
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆447Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆671Updated 4 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆556Updated 2 months ago
- Native Multimodal Models are World Learners☆1,406Updated 3 weeks ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆455Updated 10 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,080Updated 3 weeks ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆689Updated last month
- LongLive: Real-time Interactive Long Video Generation☆974Updated last week
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆363Updated 5 months ago
- The official code of Yume☆578Updated last week
- MotionStream: Real-Time Video Generation with Interactive Motion Controls☆480Updated 2 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆410Updated 4 months ago
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆538Updated 2 weeks ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆304Updated 6 months ago
- ☆304Updated last week
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆656Updated 2 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆269Updated 7 months ago
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆345Updated 2 months ago
- ☆572Updated last week
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆679Updated last month
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆417Updated last week
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,172Updated 5 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆459Updated 2 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆495Updated 2 months ago
- ☆1,921Updated last month
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆425Updated 7 months ago
- [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"☆550Updated 9 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆210Updated 2 months ago