lllyasviel / Omost
Your image is almost there!
☆7,468Updated 5 months ago
Alternatives and similar repositories for Omost:
Users that are interested in Omost are comparing it to the libraries listed below
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,505Updated 4 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,116Updated 3 months ago
- More relighting!☆7,348Updated last month
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,775Updated 6 months ago
- Kolors Team☆4,108Updated 2 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,293Updated last month
- Enjoy the magic of Diffusion models!☆6,742Updated this week
- Understand Human Behavior to Align True Needs☆3,643Updated 5 months ago
- SOTA Open Source TTS☆18,396Updated this week
- Bring portraits to life!☆13,655Updated 2 weeks ago
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,974Updated 8 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,011Updated 6 months ago
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,817Updated 2 weeks ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,982Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆8,947Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆5,400Updated 3 weeks ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,386Updated last month
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,935Updated 4 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,479Updated 6 months ago
- Inference and training library for high-quality TTS models.☆4,910Updated last month
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,389Updated 5 months ago
- 我的 ComfyUI 工作流合集 | My ComfyUI workflows collection☆5,628Updated 3 weeks ago
- Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative☆4,025Updated 3 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆8,063Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,092Updated 3 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,559Updated 6 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,119Updated 4 months ago
- Official inference repo for FLUX.1 models☆19,466Updated last week
- tiny vision language model☆6,732Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆9,662Updated this week