muzishen / RCDMs
[AAAI 2025] Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆27Updated last month
Alternatives and similar repositories for RCDMs:
Users that are interested in RCDMs are comparing it to the libraries listed below
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆65Updated 10 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆47Updated 9 months ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆48Updated 2 months ago
- ☆25Updated 4 months ago
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 5 months ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆36Updated 3 weeks ago
- ☆34Updated last month
- ☆26Updated 3 months ago
- Official code for CustAny: Customizing Anything from A Single Example☆40Updated 2 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆53Updated 5 months ago
- Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generati…☆26Updated 11 months ago
- This repo contains the code for PreciseControl project [ECCV'24]☆54Updated 4 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆21Updated 3 months ago
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆70Updated this week
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆41Updated 3 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆93Updated 10 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 5 months ago
- ☆39Updated last year
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆51Updated 10 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆74Updated last year
- ☆34Updated 4 months ago
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆51Updated 2 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆66Updated last month
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆72Updated 3 months ago
- ☆14Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆41Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 9 months ago
- Video Diffusion State Space Models☆19Updated 10 months ago
- ☆39Updated 2 months ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆18Updated last year