Your image is almost there!
☆7,639Jul 26, 2024Updated last year
Alternatives and similar repositories for Omost
Users that are interested in Omost are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,947Mar 19, 2025Updated last year
- More relighting!☆8,409Feb 20, 2025Updated last year
- ComfyUI implementation of Omost☆446Feb 25, 2025Updated last year
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,403Sep 26, 2024Updated last year
- Agent Framework For Fintech and Banks☆7,823Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Enjoy the magic of Diffusion models!☆12,212Apr 8, 2026Updated last week
- Kolors Team☆4,607Nov 13, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Open-Sora: Democratizing Efficient Video Production for All☆28,861Apr 9, 2026Updated last week
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,104Aug 30, 2024Updated last year
- Official implementation of AnimateDiff.☆12,096Jul 31, 2024Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,294Nov 27, 2025Updated 4 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,281Jul 17, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,525Jun 28, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,366Jan 24, 2025Updated last year
- PhotoMaker [CVPR 2024]☆10,111Oct 31, 2024Updated last year
- Understand Human Behavior to Align True Needs☆4,058Aug 13, 2025Updated 8 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Feb 1, 2025Updated last year
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,945Jul 18, 2024Updated last year
- Official inference repo for FLUX.1 models☆25,403Jul 31, 2025Updated 8 months ago
- Transparent Image Layer Diffusion using Latent Transparency☆2,198Jun 16, 2024Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,664Mar 5, 2025Updated last year
- ☆12,428Jul 31, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,532Jul 31, 2025Updated 8 months ago
- Focus on prompting and generating☆48,056Dec 1, 2025Updated 4 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,849Mar 7, 2025Updated last year
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,365Apr 1, 2026Updated 2 weeks ago
- A generative speech model for daily dialogue.☆39,077Updated this week
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆2,005Sep 18, 2024Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,639Nov 4, 2025Updated 5 months ago
- Bring portraits to life!☆18,117Mar 2, 2026Updated last month
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,153Mar 8, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,684Dec 4, 2024Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆108,818Updated this week
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,914Oct 31, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,289Oct 31, 2024Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆3,003Sep 8, 2024Updated last year
- Let us control diffusion models!☆33,805Feb 25, 2024Updated 2 years ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆840Jan 7, 2026Updated 3 months ago