weijiawu/ParaDiffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/weijiawu/ParaDiffusion)

weijiawu / ParaDiffusion

[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model

☆107

Alternatives and similar repositories for ParaDiffusion

Users that are interested in ParaDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

showlab / Show-Anything-3D
View on GitHub
Edit and Generate Anything in 3D world!
☆13Apr 15, 2023Updated 3 years ago
showlab / T2VScore
View on GitHub
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆81Apr 10, 2024Updated 2 years ago
showlab / ShowAnything
View on GitHub
☆83Aug 1, 2023Updated 2 years ago
ThisisBillhe / torch_quantizer
View on GitHub
torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.
☆25Mar 29, 2024Updated 2 years ago
showlab / Efficient-CLS
View on GitHub
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
☆23Jan 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
showlab / Q2A
View on GitHub
[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
☆23Jan 30, 2026Updated 5 months ago
showlab / ROICtrl
View on GitHub
Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation
☆110Apr 16, 2025Updated last year
ShihaoZhaoZSH / LaVi-Bridge
View on GitHub
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
☆300Jul 17, 2024Updated 2 years ago
showlab / cosmo
View on GitHub
☆75May 10, 2024Updated 2 years ago
bryandlee / face0-sdxl
View on GitHub
Unofficial implementation of Face0 with SDXL
☆12Sep 1, 2023Updated 2 years ago
aim-uofa / AutoStory
View on GitHub
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆149Mar 5, 2026Updated 4 months ago
TencentARC / Mix-of-Show
View on GitHub
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
☆427May 14, 2024Updated 2 years ago
aim-uofa / VLModel
View on GitHub
Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
showlab / VideoLISA
View on GitHub
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆148Dec 26, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
showlab / DoraCycle
View on GitHub
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
☆31Mar 8, 2026Updated 4 months ago
aim-uofa / OIR
View on GitHub
[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"
☆87Aug 23, 2024Updated last year
showlab / CLVQA
View on GitHub
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆42Mar 23, 2024Updated 2 years ago
aim-uofa / COSINE
View on GitHub
[ICCV'25] Unified Open-World Segmentation with Multi-Modal Prompts
☆16Jun 16, 2026Updated last month
nipunjindal / diffusers-layout-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".
☆42May 24, 2023Updated 3 years ago
LgQu / DPT-T2I
View on GitHub
Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation
☆33Mar 30, 2025Updated last year
video-reality-test / video-reality-test
View on GitHub
☆23May 5, 2026Updated 2 months ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
ThisisBillhe / ZipCache
View on GitHub
[NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
☆33Mar 30, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ilur98 / DGQ
View on GitHub
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
☆14Dec 27, 2023Updated 2 years ago
weijiawu / TransDETR
View on GitHub
[IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer
☆114Mar 28, 2024Updated 2 years ago
aim-uofa / DiverGen
View on GitHub
DiverGen (CVPR 2024) & BSGAL (ICML 2024)
☆53Jul 6, 2025Updated last year
aim-uofa / GSI-Bench
View on GitHub
[CVPR2026] Exploring Spatial Intelligence from a Generative Perspective
☆30Jun 3, 2026Updated last month
aim-uofa / GenDeF
View on GitHub
☆39Mar 5, 2026Updated 4 months ago
QingZhong1996 / Awesome-Video-Instance-Segmentation-Papers
View on GitHub
☆36Oct 21, 2022Updated 3 years ago
ali-vilab / Ranni
View on GitHub
☆237Apr 10, 2024Updated 2 years ago
showlab / DragAnything
View on GitHub
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
☆506Jul 2, 2024Updated 2 years ago
qiminchen / UNIST
View on GitHub
Pytorch Implementation of "UNIST: Unpaired Neural Implicit Shape Translation Network", CVPR 2022
☆17Apr 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mlpc-ucsd / TokenCompose
View on GitHub
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
☆137Dec 21, 2024Updated last year
junjie-shentu / Textual-Localization
View on GitHub
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
☆16Mar 10, 2024Updated 2 years ago
CSU-JPG / TextAtlas
View on GitHub
[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
☆93Sep 27, 2025Updated 9 months ago
showlab / Exo2Ego-V
View on GitHub
☆61Apr 28, 2025Updated last year
showlab / FQGAN
View on GitHub
FQGAN: Factorized Visual Tokenization and Generation
☆59Mar 29, 2025Updated last year
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
xiangyu-mm / EasyGen
View on GitHub
The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"
☆73Nov 21, 2024Updated last year