[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
β1,903Jul 3, 2025Updated 7 months ago
Alternatives and similar repositories for OminiControl
Users that are interested in OminiControl are comparing it to the libraries listed below
Sorting:
- Official repository of In-Context LoRA for Diffusion Transformersβ2,058Dec 20, 2024Updated last year
- Training-free Regional Prompting for Diffusion Transformers π₯β694Nov 28, 2024Updated last year
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,350Sep 12, 2025Updated 5 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)β664Dec 14, 2024Updated last year
- β2,230Nov 8, 2024Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,633Sep 25, 2024Updated last year
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformerβ355Mar 20, 2025Updated 11 months ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340β4,313Dec 4, 2025Updated 2 months ago
- Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"β938Dec 23, 2025Updated 2 months ago
- β572Nov 26, 2024Updated last year
- β1,357Apr 21, 2025Updated 10 months ago
- Subjects200K datasetβ130Jan 17, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,471Jun 28, 2024Updated last year
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemβ¦β2,139Dec 29, 2025Updated 2 months ago
- [CVPR 2025 Oral]Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesisβ1,546Nov 10, 2025Updated 3 months ago
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025β469Mar 19, 2025Updated 11 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,523Jul 31, 2025Updated 7 months ago
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)β1,717Jul 25, 2025Updated 7 months ago
- PixArt-Ξ±: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesisβ3,281Oct 31, 2024Updated last year
- More relighting!β8,375Feb 20, 2025Updated last year
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformerβ4,969Updated this week
- [πICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!β615May 1, 2025Updated 9 months ago
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ828Aug 30, 2025Updated 6 months ago
- β790Nov 22, 2024Updated last year
- [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ β¦β2,078Dec 19, 2025Updated 2 months ago
- β1,049May 14, 2025Updated 9 months ago
- Scalable and memory-optimized training of diffusion modelsβ1,338Jun 4, 2025Updated 8 months ago
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusionβ1,327Oct 17, 2025Updated 4 months ago
- [ICCV 2023] Consistent Image Synthesis and Editingβ837Aug 19, 2024Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,007Sep 18, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,251Feb 16, 2025Updated last year
- Open-source unified multimodal modelβ5,686Oct 27, 2025Updated 4 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modelingβ3,161Dec 21, 2024Updated last year
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editingβ3,656Oct 17, 2025Updated 4 months ago
- [ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"β433Apr 23, 2025Updated 10 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,709Dec 17, 2024Updated last year
- Nodes for image juxtaposition for Flux in ComfyUIβ1,395Jan 9, 2025Updated last year
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignmentβ1,488Sep 11, 2025Updated 5 months ago
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,912Updated this week