muzishen / RCDMs
[AAAI 2025] Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆35Updated 4 months ago
Alternatives and similar repositories for RCDMs
Users that are interested in RCDMs are comparing it to the libraries listed below
Sorting:
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆26Updated 4 months ago
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 8 months ago
- [NeurIPS 2024] IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided image …☆44Updated 4 months ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆41Updated 2 months ago
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆151Updated this week
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆40Updated last month
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated last year
- ☆32Updated 6 months ago
- EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆22Updated last month
- This repo contains the code for PreciseControl project [ECCV'24]☆62Updated 7 months ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆54Updated 5 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆57Updated 2 months ago
- ☆41Updated 4 months ago
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆21Updated last year
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆55Updated last month
- ☆29Updated last month
- ☆14Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆29Updated 2 months ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆18Updated last year
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆49Updated 6 months ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆44Updated 9 months ago
- Official implementation for the CVPR 2024 paper CAMEL☆18Updated 10 months ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆38Updated 10 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆35Updated 2 months ago
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆78Updated 5 months ago
- The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficie…☆33Updated last month
- The code of Edit-Your-Motion☆13Updated last year
- ☆19Updated 5 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆62Updated last month
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆14Updated last year