modelscope / DiffSynth-Studio
Enjoy the magic of Diffusion models!
☆8,538Updated this week
Alternatives and similar repositories for DiffSynth-Studio:
Users that are interested in DiffSynth-Studio are comparing it to the libraries listed below
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,282Updated 7 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,919Updated 4 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,940Updated 10 months ago
- Kolors Team☆4,378Updated 5 months ago
- Bring portraits to life!☆14,765Updated 2 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,847Updated 8 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,339Updated 7 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,079Updated 3 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,690Updated 10 months ago
- Your image is almost there!☆7,589Updated 9 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,585Updated 9 months ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,522Updated 2 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,378Updated 11 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,907Updated 7 months ago
- SOTA Open Source TTS☆20,964Updated 3 weeks ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,009Updated 2 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,327Updated 3 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆4,059Updated 2 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,135Updated 2 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,031Updated 3 weeks ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,306Updated 5 months ago
- TripoSR: Fast 3D Object Reconstruction from a Single Image☆5,329Updated 8 months ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆10,890Updated 2 weeks ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,410Updated 7 months ago
- ☆2,440Updated 11 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆26,355Updated last week
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,570Updated 8 months ago
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,794Updated last month
- Unofficial Implementation of Animate Anyone☆2,929Updated 9 months ago
- Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person☆5,857Updated 9 months ago