FoundationVision / WaverLinks
Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
☆650Updated 2 months ago
Alternatives and similar repositories for Waver
Users that are interested in Waver are comparing it to the libraries listed below
Sorting:
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆428Updated 2 months ago
- FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆447Updated 7 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆659Updated last month
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆652Updated 2 weeks ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆296Updated 4 months ago
- ☆264Updated this week
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆721Updated last week
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆383Updated 2 months ago
- Personalize Anything for Free with Diffusion Transformer☆350Updated 7 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆267Updated 4 months ago
- LongLive: Real-time Interactive Long Video Generation☆757Updated 2 weeks ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆994Updated 2 months ago
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆338Updated 2 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆200Updated 4 months ago
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆346Updated 5 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆410Updated this week
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆267Updated 6 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆252Updated 2 months ago
- 4-steps distilled version of Wan2.2-TI2V-5B☆102Updated last month
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆231Updated 2 months ago
- Let's finetune video generation models!☆514Updated last month
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆478Updated this week
- [CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation☆135Updated 2 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆581Updated last month
- ☆153Updated this week
- ☆379Updated 3 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆258Updated last month
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆318Updated last year
- Official implementation of OneDiffusion paper (CVPR 2025)☆651Updated 10 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆347Updated 2 months ago