[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
☆619Dec 12, 2025Updated 4 months ago
Alternatives and similar repositories for RAG-Diffusion
Users that are interested in RAG-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training-free Regional Prompting for Diffusion Transformers 🔥☆696Nov 28, 2024Updated last year
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Jul 5, 2025Updated 9 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆2,071Dec 20, 2024Updated last year
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation☆37Aug 1, 2025Updated 9 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,911Jul 3, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆96Nov 26, 2025Updated 5 months ago
- Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"☆975Apr 24, 2026Updated last week
- [TPAMI 2026] ConsistentID : Portrait Generation with Multimodal Fine-Grained Identity Preserving☆1,019Jan 2, 2026Updated 3 months ago
- [🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!☆627May 1, 2025Updated last year
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆156Dec 19, 2024Updated last year
- User Identity Scaffolding for Multiple OIDC Authentications for User☆95Jun 14, 2025Updated 10 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,844Feb 1, 2025Updated last year
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆839Apr 14, 2026Updated 2 weeks ago
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,382Mar 13, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,284Jul 17, 2024Updated last year
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,729Dec 17, 2024Updated last year
- ☆246Nov 24, 2024Updated last year
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,189Apr 15, 2025Updated last year
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,353Sep 12, 2025Updated 7 months ago
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆363Mar 26, 2026Updated last month
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,131Feb 7, 2025Updated last year
- [ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjects☆447Sep 12, 2023Updated 2 years ago
- kight is a static analysis tool for c/c++ programs.☆213Dec 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- [ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"☆265Mar 6, 2026Updated last month
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/☆399May 27, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking☆118May 18, 2025Updated 11 months ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Nov 23, 2025Updated 5 months ago
- ☆251Feb 11, 2025Updated last year
- Efficient DiT architecture for text2any tasks, ICLR2025☆446May 10, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,695Feb 27, 2025Updated last year
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆534Feb 24, 2026Updated 2 months ago
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"☆241May 24, 2024Updated last year
- ☆1,055May 14, 2025Updated 11 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,648Sep 14, 2024Updated last year
- ☆135May 6, 2024Updated last year