ant-research / MagicQuillLinks
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
☆3,463Updated 2 months ago
Alternatives and similar repositories for MagicQuill
Users that are interested in MagicQuill are comparing it to the libraries listed below
Sorting:
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,300Updated 3 weeks ago
- Official repository for LTX-Video☆6,745Updated last month
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,169Updated last week
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆2,717Updated last month
- LTX-Video Support for ComfyUI☆2,080Updated last week
- Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persisten…☆1,762Updated last month
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,979Updated 6 months ago
- ☆2,227Updated last week
- A minimal and universal controller for FLUX.1.☆1,649Updated 3 weeks ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,221Updated 3 weeks ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,518Updated last month
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,215Updated 3 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆2,570Updated 3 weeks ago
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"☆1,564Updated last month
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,450Updated this week
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,083Updated 2 weeks ago
- OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of g…☆1,794Updated last month
- [CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,857Updated 4 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,396Updated last month
- Official repository of In-Context LoRA for Diffusion Transformers☆1,919Updated 6 months ago
- The best OSS video generation models☆3,231Updated 5 months ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,729Updated last month
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,638Updated last month
- Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆818Updated 4 months ago
- ☆994Updated last month
- 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,135Updated 2 months ago
- Open-source unified multimodal model☆4,309Updated last week
- MAGI-1: Autoregressive Video Generation at Scale☆3,302Updated last week
- ☆1,457Updated last week
- Kolors Team☆4,467Updated 7 months ago