modelscope / DiffSynth-StudioLinks
Enjoy the magic of Diffusion models!
☆11,696Updated this week
Alternatives and similar repositories for DiffSynth-Studio
Users that are interested in DiffSynth-Studio are comparing it to the libraries listed below
Sorting:
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,381Updated last year
- Kolors Team☆4,596Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,403Updated 3 months ago
- More relighting!☆8,352Updated 11 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,293Updated 2 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,524Updated 2 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,987Updated last year
- The best OSS video generation models, created by Genmo☆3,588Updated 2 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,487Updated last year
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,248Updated 11 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,019Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,154Updated last year
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,364Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,650Updated 11 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,819Updated last year
- MAGI-1: Autoregressive Video Generation at Scale☆3,638Updated 7 months ago
- ☆3,167Updated 10 months ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆15,244Updated last month
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,950Updated last week
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,625Updated 10 months ago
- Bring portraits to life!☆17,743Updated 2 months ago
- Taming Stable Diffusion for Lip Sync!☆5,400Updated 7 months ago
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆4,183Updated 6 months ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,643Updated 10 months ago
- [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image☆3,536Updated 6 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,492Updated 9 months ago
- Your image is almost there!☆7,656Updated last year
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,902Updated 5 months ago
- Understand Human Behavior to Align True Needs☆4,058Updated 5 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,517Updated 6 months ago