muzishen / RCDMsLinks
[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆37Updated last week
Alternatives and similar repositories for RCDMs
Users that are interested in RCDMs are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆18Updated last year
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆46Updated 6 months ago
- ☆33Updated 11 months ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆21Updated last month
- ☆14Updated last year
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆27Updated 9 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆105Updated last year
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆23Updated 7 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆166Updated last month
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆60Updated 9 months ago
- This repo contains the code for PreciseControl project [ECCV'24]☆68Updated last year
- ☆41Updated 8 months ago
- Official code for K-LoRA (CVPR 2025)☆125Updated last week
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆51Updated 10 months ago
- ☆26Updated 5 months ago
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆57Updated 11 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆69Updated 2 months ago
- ☆19Updated 10 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆121Updated 10 months ago
- OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)☆29Updated last month
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆44Updated 2 years ago
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆23Updated last year
- ☆11Updated last year
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆51Updated last year
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆50Updated 3 months ago
- 🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high f…☆24Updated last week
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆79Updated 9 months ago
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆79Updated 10 months ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆37Updated 2 years ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year