Saiyan-World / gokuLinks
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
☆2,842Updated 3 months ago
Alternatives and similar repositories for goku
Users that are interested in goku are comparing it to the libraries listed below
Sorting:
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,460Updated last week
- Official repository for LTX-Video☆6,281Updated last week
- ☆2,980Updated 2 months ago
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆2,273Updated 2 weeks ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,175Updated 2 months ago
- SkyReels-V2: Infinite-length Film Generative model☆2,645Updated last week
- The best OSS video generation models☆3,183Updated 4 months ago
- FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,288Updated 2 weeks ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆962Updated 2 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,197Updated 2 weeks ago
- ☆2,144Updated last month
- [ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,231Updated last week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,056Updated this week
- LTX-Video Support for ComfyUI☆1,992Updated 2 weeks ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,285Updated this week
- Official PyTorch implementation of One-Minute Video Generation with Test-Time Training☆1,576Updated last month
- MAGI-1: Autoregressive Video Generation at Scale☆3,191Updated this week
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆2,779Updated 3 weeks ago
- LatentSync: Taming Audio-Conditioned Latent Diffusion Models for Lip Sync with SyncNet Supervision☆4,111Updated 2 weeks ago
- 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,075Updated last month
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,326Updated this week
- Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persisten…☆1,565Updated 2 weeks ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆3,805Updated 3 months ago
- A minimal and universal controller for FLUX.1.☆1,601Updated 2 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,157Updated 2 months ago
- ☆745Updated 3 months ago
- Open-source unified multimodal model☆3,499Updated this week
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆10,179Updated last week
- Wan: Open and Advanced Large-Scale Video Generative Models☆11,851Updated this week
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,089Updated 3 months ago