lllyasviel / Omost
Your image is almost there!
☆7,207Updated last month
Related projects: ⓘ
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,127Updated last week
- Create Magic Story!☆5,787Updated last month
- Enjoy the magic of Diffusion models!☆6,349Updated this week
- More relighting!☆4,865Updated 2 months ago
- Kolors Team☆3,526Updated 2 weeks ago
- Official inference repo for FLUX.1 models☆13,678Updated this week
- Understand Human Behavior to Align True Needs☆3,268Updated last month
- ☆4,269Updated last month
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,182Updated 2 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,498Updated 2 months ago
- Bring portraits to life!☆11,729Updated last week
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆11,247Updated this week
- Official Code for Stable Cascade☆6,511Updated last month
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,788Updated 2 weeks ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆3,926Updated 5 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,216Updated 2 months ago
- Official implementation of AnimateDiff.☆10,270Updated last month
- Open-Sora: Democratizing Efficient Video Production for All☆21,609Updated last month
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,392Updated 4 months ago
- 我的 ComfyUI 工作流合集 | My ComfyUI workflows collection☆4,747Updated last month
- Brand new TTS solution☆11,190Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,398Updated last month
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,451Updated 5 months ago
- tiny vision language model☆4,893Updated 3 weeks ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,459Updated 2 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆10,850Updated 2 months ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,359Updated 2 months ago
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, …☆4,490Updated last month
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆9,465Updated last month