instantX-research / Regional-Prompting-FLUXView external linksLinks
Training-free Regional Prompting for Diffusion Transformers π₯
β692Nov 28, 2024Updated last year
Alternatives and similar repositories for Regional-Prompting-FLUX
Users that are interested in Regional-Prompting-FLUX are comparing it to the libraries listed below
Sorting:
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformerβ1,903Jul 3, 2025Updated 7 months ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β620Dec 12, 2025Updated 2 months ago
- Official repository of In-Context LoRA for Diffusion Transformersβ2,058Dec 20, 2024Updated last year
- Nodes for image juxtaposition for Flux in ComfyUIβ1,394Jan 9, 2025Updated last year
- β572Nov 26, 2024Updated last year
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025β470Mar 19, 2025Updated 10 months ago
- β426Nov 4, 2024Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,521Jul 31, 2025Updated 6 months ago
- Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"β936Dec 23, 2025Updated last month
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformerβ354Mar 20, 2025Updated 10 months ago
- β790Nov 22, 2024Updated last year
- [πICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!β613May 1, 2025Updated 9 months ago
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,350Sep 12, 2025Updated 5 months ago
- β479Oct 30, 2024Updated last year
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340β4,314Dec 4, 2025Updated 2 months ago
- β1,354Apr 21, 2025Updated 9 months ago
- β2,229Nov 8, 2024Updated last year
- β388Jul 13, 2025Updated 7 months ago
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservationβ368May 21, 2025Updated 8 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,005Sep 18, 2024Updated last year
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Controlβ191Dec 31, 2024Updated last year
- β1,048May 14, 2025Updated 9 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)β665Dec 14, 2024Updated last year
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!β512Dec 11, 2024Updated last year
- Concept Sliders for Precise Control of Diffusion Modelsβ1,127Jun 20, 2025Updated 7 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,632Sep 25, 2024Updated last year
- StoryMaker: Towards consistent characters in text-to-image generationβ720Dec 2, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,276Jul 17, 2024Updated last year
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ307Jul 30, 2025Updated 6 months ago
- β235May 9, 2025Updated 9 months ago
- A set of nodes to edit videos using the Hunyuan Video modelβ494Feb 21, 2025Updated 11 months ago
- The ultimate training toolkit for finetuning diffusion modelsβ9,449Feb 7, 2026Updated last week
- Official repo for CFG-Zero*β704May 2, 2025Updated 9 months ago
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Modelsβ3,685Updated this week
- CSGO: Content-Style Composition in Text-to-Image Generation π₯β384Sep 5, 2024Updated last year
- [SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customizationβ1,745Aug 14, 2025Updated 6 months ago
- [ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizationβ276May 1, 2025Updated 9 months ago
- Implement Region Attention for Flux modelβ140Mar 2, 2025Updated 11 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Feb 1, 2025Updated last year