[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
β1,913Jul 3, 2025Updated 11 months ago
Alternatives and similar repositories for OminiControl
Users that are interested in OminiControl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of In-Context LoRA for Diffusion Transformersβ2,074Dec 20, 2024Updated last year
- Training-free Regional Prompting for Diffusion Transformers π₯β699Nov 28, 2024Updated last year
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,356Sep 12, 2025Updated 8 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)β661Dec 14, 2024Updated last year
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformerβ361Mar 26, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β2,233Nov 8, 2024Updated last year
- Subjects200K datasetβ131Jan 17, 2025Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,641Sep 25, 2024Updated last year
- β571Nov 26, 2024Updated last year
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340β4,326Dec 4, 2025Updated 6 months ago
- Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"β985May 27, 2026Updated last week
- β1,368Apr 21, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,592Jun 28, 2024Updated last year
- [CVPR 2025 Oral]Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesisβ1,570Apr 16, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)β1,725Jul 25, 2025Updated 10 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemβ¦β2,217Apr 29, 2026Updated last month
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,541Jul 31, 2025Updated 10 months ago
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025β475Mar 19, 2025Updated last year
- β794Nov 22, 2024Updated last year
- [πICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!β630May 1, 2025Updated last year
- More relighting!β8,437Feb 20, 2025Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editingβ846Aug 19, 2024Updated last year
- [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ β¦β2,095Dec 19, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β1,049May 14, 2025Updated last year
- PixArt-Ξ±: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesisβ3,301Oct 31, 2024Updated last year
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ845Apr 14, 2026Updated last month
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,006Sep 18, 2024Updated last year
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformerβ8,104Jun 1, 2026Updated last week
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusionβ1,330Oct 17, 2025Updated 7 months ago
- Open-source unified multimodal modelβ5,980May 4, 2026Updated last month
- Scalable and memory-optimized training of diffusion modelsβ1,359May 26, 2026Updated 2 weeks ago
- [ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"β430Apr 23, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,251Feb 16, 2025Updated last year
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,738Dec 17, 2024Updated last year
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editingβ3,798Oct 17, 2025Updated 7 months ago
- [ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizationβ275May 1, 2025Updated last year
- Nodes for image juxtaposition for Flux in ComfyUIβ1,398Jan 9, 2025Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modelingβ3,194Dec 21, 2024Updated last year
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RLβ2,325May 7, 2026Updated last month