mobiushy / move-actLinks
☆11Updated last year
Alternatives and similar repositories for move-act
Users that are interested in move-act are comparing it to the libraries listed below
Sorting:
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆52Updated last year
- ☆22Updated 2 years ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆46Updated 2 years ago
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆20Updated 10 months ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆38Updated 2 years ago
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆15Updated 8 months ago
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Updated last year
- Video Diffusion State Space Models☆19Updated last year
- Training Autoregressive Image Generation models via Reinforcement Learning☆48Updated 2 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Updated last year
- ☆21Updated last year
- An innovative method designed to augment the capabilities of existing video diffusion models☆22Updated last year
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated last year
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆66Updated last year
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆46Updated 10 months ago
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated 2 years ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆21Updated 5 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆69Updated last year
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆61Updated last year
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated 2 years ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Updated last year
- OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)☆34Updated 5 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 8 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆109Updated last year
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Updated 5 months ago
- Codes of PostEdit☆23Updated 9 months ago
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Updated last year