Tongyi-MAI / Z-ImageLinks
☆7,867Updated this week
Alternatives and similar repositories for Z-Image
Users that are interested in Z-Image are comparing it to the libraries listed below
Sorting:
- Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.☆6,525Updated last week
- [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ …☆2,051Updated last week
- TurboDiffusion: 100–200× Acceleration for Video Diffusion Models☆2,893Updated this week
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,537Updated 2 months ago
- ☆2,488Updated 5 months ago
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,617Updated 2 months ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,613Updated 9 months ago
- Open-source unified multimodal model☆5,505Updated 2 months ago
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,550Updated this week
- LTX-Video Support for ComfyUI☆2,461Updated 3 weeks ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆13,217Updated 2 weeks ago
- OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871☆3,979Updated 3 weeks ago
- Light Image Video Generation Inference Framework☆1,627Updated this week
- ☆5,767Updated this week
- The desktop app for ComfyUI (Windows & macOS)☆1,965Updated this week
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)☆1,708Updated 5 months ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,101Updated 9 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,758Updated 7 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆4,078Updated last week
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,752Updated last week
- Official inference repo for FLUX.2 models☆1,289Updated 3 weeks ago
- ☆1,608Updated 6 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,196Updated 2 months ago
- Qwen-Image-Layered: Layered Decomposition for Inherent Editablity☆1,030Updated this week
- A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.☆3,598Updated last week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,847Updated last week
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,078Updated 2 weeks ago
- Official implementation of HYPIR: Harnessing Diffusion-Yielded Score Priors for Image Restoration (SIGGRAPH 2025)☆1,023Updated 2 months ago
- Official SeedVR2 Video Upscaler for ComfyUI☆1,697Updated last week
- MAGI-1: Autoregressive Video Generation at Scale☆3,620Updated 6 months ago