muzishen / RCDMs
[AAAI 2025] Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆35Updated 3 months ago
Alternatives and similar repositories for RCDMs:
Users that are interested in RCDMs are comparing it to the libraries listed below
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆28Updated 3 months ago
- [NeurIPS 2024] IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided image …☆38Updated 3 months ago
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆134Updated last month
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆49Updated 3 months ago
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 6 months ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆40Updated 2 weeks ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated 11 months ago
- [AAAI 2025] CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning☆17Updated 3 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆22Updated 4 months ago
- ☆27Updated 4 months ago
- EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆16Updated last week
- Official implementation for the CVPR 2024 paper CAMEL☆17Updated 9 months ago
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆66Updated 11 months ago
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆20Updated 3 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆35Updated last month
- ☆39Updated last year
- ☆17Updated last month
- ☆37Updated 2 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆21Updated 2 weeks ago
- ☆27Updated 5 months ago
- [NeurIPS2024] Overcome hallucination of diffusion restoration models.☆35Updated 2 months ago
- ☆18Updated 4 months ago
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Updated last year
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing☆24Updated 3 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 6 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆46Updated last month
- [CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.☆51Updated last month
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆127Updated 10 months ago
- This repo contains the code for PreciseControl project [ECCV'24]☆57Updated 5 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆30Updated 3 months ago