NJU-PCALab/RAG-Diffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NJU-PCALab/RAG-Diffusion)

NJU-PCALab / RAG-Diffusion

[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥

☆622

Alternatives and similar repositories for RAG-Diffusion

Users that are interested in RAG-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

instantX-research / Regional-Prompting-FLUX
View on GitHub
Training-free Regional Prompting for Diffusion Transformers 🔥
☆696Nov 28, 2024Updated last year
NJU-PCALab / InstanceCap
View on GitHub
[CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍
☆45Jul 5, 2025Updated last year
NJU-PCALab / CoDi
View on GitHub
CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation
☆36Aug 1, 2025Updated 11 months ago
ali-vilab / In-Context-LoRA
View on GitHub
Official repository of In-Context LoRA for Diffusion Transformers
☆2,078Dec 20, 2024Updated last year
Yuanshi9815 / OminiControl
View on GitHub
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,925Jul 2, 2026Updated 2 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
NJU-PCALab / L2P
View on GitHub
L2P: Unlocking Latent Potential for Pixel Generation
☆39May 22, 2026Updated last month
ZNan-Chen / Awesome-Visual-Autoregressive-Model
View on GitHub
Latest Advances on Autoregressive Visual Models.📖
☆28Mar 15, 2025Updated last year
NJU-PCALab / TextCrafter
View on GitHub
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
☆97Nov 26, 2025Updated 7 months ago
JackAILab / ConsistentID
View on GitHub
[TPAMI 2026] ConsistentID : Portrait Generation with Multimodal Fine-Grained Identity Preserving
☆1,027Jan 2, 2026Updated 6 months ago
TencentYoutuResearch / T2I-L2P
View on GitHub
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
☆179Jul 11, 2026Updated last week
wangjiangshan0725 / RF-Solver-Edit
View on GitHub
[🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!
☆637May 1, 2025Updated last year
jtun-coder / JtunRouter
View on GitHub
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…
☆156Jul 14, 2026Updated last week
530051970 / auth-hub-demo
View on GitHub
User Identity Scaffolding for Multiple OIDC Authentications for User
☆95Jun 14, 2025Updated last year
fallenshock / FlowEdit
View on GitHub
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
☆1,009May 27, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,838Feb 1, 2025Updated last year
PKU-YuanGroup / ConsisID
View on GitHub
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
☆848Apr 14, 2026Updated 3 months ago
NJU-PCALab / UltraHR-100k
View on GitHub
This is the official repository of UltraHR-100K.
☆45Nov 21, 2025Updated 8 months ago
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
fudan-generative-vision / hallo3
View on GitHub
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
☆1,394Mar 13, 2025Updated last year
TencentARC / BrushNet
View on GitHub
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
☆1,737Dec 17, 2024Updated last year
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
ali-vilab / UniAnimate
View on GitHub
Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".
☆1,190Apr 15, 2025Updated last year
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ali-vilab / Cones-V2
View on GitHub
[ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjects
☆446Sep 12, 2023Updated 2 years ago
bytedance / UNO
View on GitHub
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
☆1,359Sep 12, 2025Updated 10 months ago
fenghora / personalize-anything
View on GitHub
[AAAI 2026] Personalize Anything for Free with Diffusion Transformer
☆361Mar 26, 2026Updated 3 months ago
MingXiangL / DEVIL
View on GitHub
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
☆274Dec 3, 2024Updated last year
360CVGroup / Qihoo-T2X
View on GitHub
Efficient DiT architecture for text2any tasks, ICLR2025
☆446May 10, 2025Updated last year
Falling-dow / Unsupervised-Image-Enhancement-with-CNN-and-GAN
View on GitHub
Advanced Unsupervised Image Enhancement with GAN
☆247Nov 11, 2024Updated last year
MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
dongxuyue / Open-ReplaceAnything
View on GitHub
Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/
☆399May 27, 2024Updated 2 years ago
rhymes-ai / Allegro
View on GitHub
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…
☆1,133Feb 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SheldongChen / CLaM
View on GitHub
An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator
☆99Nov 23, 2025Updated 7 months ago
yileijin / Bootstrap-GS
View on GitHub
☆251Feb 11, 2025Updated last year
SiyangLi99 / open-alteryx-macro
View on GitHub
Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…
☆156May 25, 2024Updated 2 years ago
bcmi / Awesome-Object-Insertion
View on GitHub
A curated list of papers, code and resources pertaining to image composition/compositing or object/subject insertion/addition/compositing…
☆540Apr 30, 2026Updated 2 months ago
NJU-PCALab / STTrack
View on GitHub
[AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
☆118May 18, 2025Updated last year
fudan-generative-vision / hallo2
View on GitHub
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆3,722Feb 27, 2025Updated last year
YUCHEN005 / STAR-Adapt
View on GitHub
Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"
☆241May 24, 2024Updated 2 years ago