modelscope / DiffSynth-StudioLinks
Enjoy the magic of Diffusion models!
☆9,895Updated last week
Alternatives and similar repositories for DiffSynth-Studio
Users that are interested in DiffSynth-Studio are comparing it to the libraries listed below
Sorting:
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,339Updated 11 months ago
- More relighting!☆8,212Updated 6 months ago
- Kolors Team☆4,527Updated 9 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,001Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,229Updated 7 months ago
- Understand Human Behavior to Align True Needs☆3,988Updated 3 weeks ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,909Updated this week
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,596Updated 6 months ago
- Your image is almost there!☆7,663Updated last year
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,447Updated last month
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,355Updated 7 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,933Updated last year
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,196Updated 6 months ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,595Updated 5 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,053Updated 8 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,775Updated last year
- [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image☆3,469Updated last month
- Character Animation (AnimateAnyone, Face Reenactment)☆3,434Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,464Updated last month
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,472Updated this week
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆11,015Updated 2 weeks ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,574Updated 11 months ago
- [AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"☆6,405Updated last year
- The ultimate training toolkit for finetuning diffusion models☆6,227Updated last week
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,806Updated last year
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,900Updated 5 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆27,135Updated 4 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,212Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,185Updated last year
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,247Updated 2 months ago