SihuiJi / FashionComposerLinks
☆25Updated last year
Alternatives and similar repositories for FashionComposer
Users that are interested in FashionComposer are comparing it to the libraries listed below
Sorting:
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 11 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆24Updated last week
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆76Updated 6 months ago
- ☆133Updated 9 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆38Updated 6 months ago
- ☆32Updated 9 months ago
- [IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …☆99Updated 8 months ago
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆107Updated last year
- Controllable Animation Video Generation with Large Models-based Multimodal Agents☆221Updated this week
- VideoCoF: Unified Video Editing with Temporal Reasoner☆122Updated last week
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆118Updated 3 weeks ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆49Updated 9 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆71Updated 5 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆44Updated 4 months ago
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Updated last year
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆95Updated 9 months ago
- Vision Bridge Transformer at Scale☆133Updated last month
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆114Updated 3 months ago
- ☆53Updated last year
- ☆86Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆110Updated 7 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆25Updated last month
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆109Updated 3 months ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Updated last year
- ☆91Updated 4 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆86Updated last month
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Updated 5 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 10 months ago
- ☆29Updated 9 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆109Updated last month