[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆106Mar 24, 2025Updated 11 months ago
Alternatives and similar repositories for ParaDiffusion
Users that are interested in ParaDiffusion are comparing it to the libraries listed below
Sorting:
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- ☆83Aug 1, 2023Updated 2 years ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Nov 23, 2024Updated last year
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆111Apr 16, 2025Updated 10 months ago
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆428May 14, 2024Updated last year
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated last month
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation☆298Jul 17, 2024Updated last year
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆23Jan 8, 2024Updated 2 years ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Aug 23, 2024Updated last year
- ☆73May 10, 2024Updated last year
- The code repository of UniRL☆51May 30, 2025Updated 9 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆23Mar 29, 2024Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆60Dec 17, 2024Updated last year
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Nov 21, 2024Updated last year
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆146Dec 26, 2024Updated last year
- ☆238Apr 10, 2024Updated last year
- [CVPR 2025] DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆30May 13, 2025Updated 9 months ago
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Mar 10, 2024Updated last year
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- [CVPR2023] All in One: Exploring Unified Video-Language Pre-training☆281Mar 25, 2023Updated 2 years ago
- ☆14Oct 16, 2023Updated 2 years ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆504Nov 16, 2024Updated last year
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆506Jul 2, 2024Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆53Jul 6, 2025Updated 7 months ago
- [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction☆77Aug 13, 2024Updated last year
- Pytorch Implementation of "UNIST: Unpaired Neural Implicit Shape Translation Network", CVPR 2022☆17Apr 29, 2023Updated 2 years ago
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- Glance: Accelerating Diffusion Models with 1 Sample☆152Dec 24, 2025Updated 2 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆136Dec 21, 2024Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)☆349Jul 26, 2024Updated last year
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆86Sep 27, 2025Updated 5 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆309Mar 12, 2025Updated 11 months ago
- ☆39Dec 8, 2023Updated 2 years ago
- ☆580Dec 21, 2024Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆41Mar 23, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,280Jul 17, 2024Updated last year