muzishen / RCDMs
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆20Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for RCDMs
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆33Updated 4 months ago
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆64Updated 7 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 7 months ago
- Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generati…☆24Updated 8 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆46Updated 7 months ago
- ☆14Updated 11 months ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆40Updated 4 months ago
- This repo contains the code for PreciseControl project [ECCV'24]☆41Updated last month
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆32Updated last week
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆34Updated 4 months ago
- Official code for CustAny: Customizing Anything from A Single Example☆38Updated this week
- Official pytorch implementation for SingleInsert☆26Updated 7 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆18Updated 3 weeks ago
- Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models☆24Updated 2 months ago
- ☆28Updated this week
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 2 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild☆13Updated last month
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆38Updated 2 weeks ago
- [ACM MM 2024] Frame Interpolation with Consecutive Brownian Bridge Diffusion Model☆26Updated this week
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆34Updated last month
- More suitable IP-Adapter for the DiT architecture☆26Updated 4 months ago
- [ICCV 2023] The code used in our paper "Deep Image Harmonization with Learnable Augmentation", ICCV2023.☆36Updated 6 months ago
- ☆19Updated 2 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆50Updated last month
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆38Updated last year
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆52Updated 2 months ago
- ☆19Updated last year
- an unofficial implementation of dreamtuner☆24Updated 9 months ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆18Updated 10 months ago