Nithin-GK / MaxFusion
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
☆22Updated 4 months ago
Alternatives and similar repositories for MaxFusion:
Users that are interested in MaxFusion are comparing it to the libraries listed below
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Updated last year
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆12Updated 3 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated 11 months ago
- ☆16Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 6 months ago
- ☆27Updated last week
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated last year
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 7 months ago
- ☆20Updated last year
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆40Updated 3 weeks ago
- ☆15Updated last year
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆52Updated 7 months ago
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆18Updated 2 weeks ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- Video Diffusion Transformers are In-Context Learners☆20Updated 2 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆47Updated 3 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22Updated 10 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 11 months ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆56Updated 9 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆44Updated 3 months ago
- Video Diffusion State Space Models☆19Updated last year
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆21Updated 11 months ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆18Updated last year
- ☆36Updated last month
- [ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation☆67Updated last week
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".☆48Updated last year
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆39Updated last year
- ☆13Updated last week
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- [ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models☆41Updated 8 months ago