FoundationVision / WaverLinks
Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
☆785Updated 4 months ago
Alternatives and similar repositories for Waver
Users that are interested in Waver are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆670Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆669Updated 3 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆665Updated 2 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆445Updated 3 weeks ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆541Updated 2 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆304Updated 6 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆455Updated 9 months ago
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆345Updated 2 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,050Updated last week
- LongLive: Real-time Interactive Long Video Generation☆932Updated 3 weeks ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆570Updated last month
- Native Multimodal Models are World Learners☆1,374Updated last month
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,128Updated 4 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆270Updated 6 months ago
- ☆382Updated 5 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆403Updated 4 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆359Updated 4 months ago
- Taming large-scale few-step training with self-adversarial flows! 👏🏻☆349Updated this week
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆210Updated last month
- MotionStream: Real-Time Video Generation with Interactive Motion Controls☆455Updated last month
- rCM: SOTA Diffusion Distillation & Few-Step Video Generation based on sCM/MeanFlow☆430Updated this week
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆464Updated last week
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆639Updated last month
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆491Updated 2 months ago
- Unofficial extension implementation of Self-Forcing to support I2V && 14B training.☆304Updated 3 months ago
- ☆316Updated 3 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆824Updated last week
- ☆260Updated last week
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆419Updated 6 months ago
- ☆367Updated 9 months ago