modelscope / DiffSynth-Studio
Enjoy the magic of Diffusion models!
☆6,349Updated this week
Related projects: ⓘ
- Create Magic Story!☆5,787Updated last month
- Your image is almost there!☆7,207Updated last month
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,498Updated 2 months ago
- More relighting!☆4,865Updated 2 months ago
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,127Updated last week
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,182Updated 2 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,398Updated last month
- Kolors Team☆3,526Updated 2 weeks ago
- Brand new TTS solution☆11,190Updated this week
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,392Updated 4 months ago
- ☆4,269Updated last month
- Bring portraits to life!☆11,729Updated last week
- Real time interactive streaming digital human☆3,462Updated last week
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,216Updated 2 months ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,359Updated 2 months ago
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆3,610Updated last month
- Understand Human Behavior to Align True Needs☆3,268Updated last month
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,376Updated last month
- Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image☆2,850Updated last month
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,108Updated last month
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆11,247Updated this week
- Inference and training library for high-quality TTS models.☆4,193Updated last month
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,318Updated 2 months ago
- 我的 ComfyUI 工作流合集 | My ComfyUI workflows collection☆4,747Updated last month
- Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person☆5,526Updated last month
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,788Updated 2 weeks ago
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆9,465Updated last month
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆3,926Updated 5 months ago
- Official inference repo for FLUX.1 models☆13,678Updated this week