meituan-longcat / LongCat-ImageLinks
☆113Updated this week
Alternatives and similar repositories for LongCat-Image
Users that are interested in LongCat-Image are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆163Updated 5 months ago
- ☆282Updated 4 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆442Updated last week
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆187Updated 11 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆626Updated 2 weeks ago
- ☆315Updated 2 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆415Updated 6 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆212Updated 2 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆273Updated 2 months ago
- ☆95Updated last month
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆351Updated 8 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆812Updated 2 weeks ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆661Updated last month
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆359Updated 6 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆139Updated 6 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆237Updated 3 months ago
- ☆320Updated last week
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆302Updated 4 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆279Updated 2 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆179Updated 4 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆144Updated 2 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆453Updated 9 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆83Updated 2 weeks ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆337Updated this week
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆197Updated 6 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆185Updated last week
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆203Updated 9 months ago
- ☆112Updated 7 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆88Updated 5 months ago
- ☆268Updated last year