FoundationVision / WaverLinks
A video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
☆617Updated last month
Alternatives and similar repositories for Waver
Users that are interested in Waver are comparing it to the libraries listed below
Sorting:
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆638Updated last week
- FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆446Updated 7 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆421Updated 2 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆649Updated last month
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆294Updated 3 months ago
- Personalize Anything for Free with Diffusion Transformer☆349Updated 6 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆248Updated 2 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆713Updated 2 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆265Updated 4 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆344Updated 2 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆379Updated last month
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆948Updated 2 months ago
- Let's finetune video generation models!☆507Updated 3 weeks ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆467Updated 3 months ago
- [CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation☆131Updated last month
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆334Updated last month
- ☆373Updated 2 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆229Updated last month
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆194Updated 3 months ago
- Unofficial extension implementation of Self-Forcing to support I2V && 14B training.☆189Updated last week
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆317Updated last year
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆178Updated 2 months ago
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆191Updated 4 months ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆265Updated 5 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆302Updated 6 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆257Updated 3 weeks ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆119Updated 8 months ago
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆341Updated 4 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 8 months ago
- 4-steps distilled version of Wan2.2-TI2V-5B☆87Updated 3 weeks ago