[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆107Mar 24, 2025Updated last year
Alternatives and similar repositories for ParaDiffusion
Users that are interested in ParaDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- ☆83Aug 1, 2023Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated last year
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆23Jan 8, 2024Updated 2 years ago
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated last month
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆110Apr 16, 2025Updated 11 months ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Mar 5, 2026Updated 2 weeks ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation☆299Jul 17, 2024Updated last year
- ☆74May 10, 2024Updated last year
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆428May 14, 2024Updated last year
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- The code repository of UniRL☆51May 30, 2025Updated 9 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆146Dec 26, 2024Updated last year
- DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆32Mar 8, 2026Updated 2 weeks ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Aug 23, 2024Updated last year
- [ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"☆28Feb 4, 2026Updated last month
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆42Mar 23, 2024Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Mar 30, 2025Updated 11 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- ☆238Apr 10, 2024Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆53Jul 6, 2025Updated 8 months ago
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆31Mar 30, 2025Updated 11 months ago
- ☆39Mar 5, 2026Updated 2 weeks ago
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆505Jul 2, 2024Updated last year
- ☆37Oct 21, 2022Updated 3 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆109Mar 28, 2024Updated last year
- Pytorch Implementation of "UNIST: Unpaired Neural Implicit Shape Translation Network", CVPR 2022☆17Apr 29, 2023Updated 2 years ago
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Mar 10, 2024Updated 2 years ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆49Apr 14, 2025Updated 11 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆61Dec 17, 2024Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆136Dec 21, 2024Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)☆349Jul 26, 2024Updated last year
- Glance: Accelerating Diffusion Models with 1 Sample☆152Dec 24, 2025Updated 3 months ago
- [CVPR2023] All in One: Exploring Unified Video-Language Pre-training☆281Mar 25, 2023Updated 2 years ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM☆14Dec 27, 2023Updated 2 years ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated last year