meituan-longcat / LongCat-ImageLinks
☆519Updated last week
Alternatives and similar repositories for LongCat-Image
Users that are interested in LongCat-Image are comparing it to the libraries listed below
Sorting:
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆445Updated 3 weeks ago
- ☆286Updated 5 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆824Updated last week
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆670Updated last month
- ☆316Updated 3 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆419Updated 6 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆275Updated 2 weeks ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆665Updated 2 months ago
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆351Updated 9 months ago
- Unofficial extension implementation of Self-Forcing to support I2V && 14B training.☆304Updated 3 months ago
- All-round Creator and Editor☆239Updated 2 months ago
- VideoGen-Eval: Agent-based System for Video Generation Evaluation☆253Updated 2 weeks ago
- ☆333Updated this week
- Pusa: Thousands Timesteps Video Diffusion Model☆669Updated 3 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆280Updated 2 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆164Updated 3 weeks ago
- ☆367Updated 9 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆189Updated last year
- ☆542Updated 3 weeks ago
- AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation☆58Updated 7 months ago
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆363Updated 7 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆85Updated last month
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆212Updated 3 months ago
- [ICCV 25] OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting☆304Updated 2 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆491Updated 2 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆305Updated 5 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆455Updated 9 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆182Updated 5 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆260Updated 4 months ago
- [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"☆536Updated 8 months ago