Saiyan-World / goku
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
☆2,755Updated last month
Alternatives and similar repositories for goku:
Users that are interested in goku are comparing it to the libraries listed below
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆1,915Updated 3 weeks ago
- ☆2,734Updated 2 weeks ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,213Updated this week
- Taming Stable Diffusion for Lip Sync!☆3,317Updated last week
- The best OSS video generation models☆3,056Updated 2 months ago
- [CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,254Updated 2 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,796Updated this week
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,868Updated last month
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation☆1,434Updated last month
- Motion-Controllable Video Diffusion via Warped Noise☆820Updated last week
- ☆887Updated 2 weeks ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,862Updated 3 months ago
- [ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆755Updated this week
- ☆4,087Updated 2 weeks ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆3,382Updated last month
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆516Updated last week
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,015Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆9,327Updated this week
- Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆741Updated last month
- Official repository for LTX-Video☆3,221Updated 3 weeks ago
- Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds☆1,259Updated this week
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,098Updated 3 weeks ago
- ☆716Updated last month
- CVPR2025☆812Updated last week
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,471Updated 2 months ago
- [CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation☆892Updated this week
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆2,225Updated 2 weeks ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆951Updated last week
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,210Updated last month
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,636Updated this week