muzishen / RCDMsLinks
[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆34Updated 3 weeks ago
Alternatives and similar repositories for RCDMs
Users that are interested in RCDMs are comparing it to the libraries listed below
Sorting:
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆26Updated 6 months ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆55Updated 6 months ago
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 9 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆36Updated 4 months ago
- [NeurIPS 2024] 🕺IMAGPose🕺: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided im…☆51Updated 3 weeks ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆42Updated 3 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆50Updated last year
- ☆33Updated 7 months ago
- This repo contains the code for PreciseControl project [ECCV'24]☆63Updated 8 months ago
- 🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high f…☆18Updated 2 weeks ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆43Updated 2 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆103Updated last year
- Official code for K-LoRA (CVPR 2025)☆112Updated 2 weeks ago
- ☆19Updated 7 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆64Updated 2 weeks ago
- ☆39Updated last year
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆170Updated last month
- ☆30Updated 8 months ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆37Updated last year
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆68Updated last year
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆46Updated 2 months ago
- ☆40Updated 5 months ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆22Updated 4 months ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- ☆14Updated last year
- Official Implement of the work "Coherent and Multi-modality Image Inpainting via Latent Space Optimization"☆53Updated 2 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆51Updated 6 months ago
- ☆26Updated 3 months ago
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆51Updated 6 months ago
- [CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.☆54Updated 4 months ago