lllyasviel / OmostView external linksLinks
Your image is almost there!
☆7,654Jul 26, 2024Updated last year
Alternatives and similar repositories for Omost
Users that are interested in Omost are comparing it to the libraries listed below
Sorting:
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,942Mar 19, 2025Updated 10 months ago
- More relighting!☆8,367Feb 20, 2025Updated 11 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,383Sep 26, 2024Updated last year
- Enjoy the magic of Diffusion models!☆11,773Updated this week
- ComfyUI implementation of Omost☆446Feb 25, 2025Updated 11 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,251Feb 16, 2025Updated 11 months ago
- Kolors Team☆4,599Nov 13, 2024Updated last year
- Open-Sora: Democratizing Efficient Video Production for All☆28,510Apr 30, 2025Updated 9 months ago
- Agent Framework For Fintech and Banks☆7,778Updated this week
- PhotoMaker [CVPR 2024]☆10,118Oct 31, 2024Updated last year
- Official implementation of AnimateDiff.☆12,009Jul 31, 2024Updated last year
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,107Aug 30, 2024Updated last year
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,365Jan 24, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,462Jun 28, 2024Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,295Nov 27, 2025Updated 2 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,909Jul 18, 2024Updated last year
- Official inference repo for FLUX.1 models☆25,210Jul 31, 2025Updated 6 months ago
- Understand Human Behavior to Align True Needs☆4,058Aug 13, 2025Updated 6 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,276Jul 17, 2024Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,654Mar 5, 2025Updated 11 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Feb 1, 2025Updated last year
- Focus on prompting and generating☆47,688Dec 1, 2025Updated 2 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,521Jul 31, 2025Updated 6 months ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,125Oct 29, 2025Updated 3 months ago
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,756Updated this week
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,602Dec 4, 2024Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,841Mar 7, 2025Updated 11 months ago
- Transparent Image Layer Diffusion using Latent Transparency☆2,191Jun 16, 2024Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,426Nov 4, 2025Updated 3 months ago
- ☆12,175Jul 31, 2025Updated 6 months ago
- Bring portraits to life!☆17,799Nov 16, 2025Updated 2 months ago
- A generative speech model for daily dialogue.☆38,696Jan 18, 2026Updated 3 weeks ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆2,006Sep 18, 2024Updated last year
- SOTA Open Source TTS☆24,863Feb 2, 2026Updated last week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆103,139Updated this week
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,279Oct 31, 2024Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,020Jul 2, 2024Updated last year
- Let us control diffusion models!☆33,640Feb 25, 2024Updated last year
- Industry leading face manipulation platform☆26,787Updated this week