GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
☆806Feb 2, 2026Updated last month
Alternatives and similar repositories for GLM-Image
Users that are interested in GLM-Image are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆2,045Nov 4, 2025Updated 4 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆240Jan 24, 2026Updated last month
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆670Oct 14, 2025Updated 4 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- [NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulatio…☆622Oct 22, 2025Updated 4 months ago
- ☆35Nov 5, 2024Updated last year
- Official implementation of BLIP3o-Series☆1,642Nov 29, 2025Updated 3 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated 10 months ago
- An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation☆1,535Oct 16, 2025Updated 4 months ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,105Mar 29, 2025Updated 11 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)☆665Dec 14, 2024Updated last year
- Official code for ECCV 2024 paper: Learn to Optimize Denoising Scores A Unified and Improved Diffusion Prior for 3D Generation☆72Jul 11, 2024Updated last year
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,897Feb 3, 2026Updated last month
- ☆787Jul 17, 2025Updated 7 months ago
- Qwen-Image-Layered: Layered Decomposition for Inherent Editablity☆1,603Dec 31, 2025Updated 2 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆463Dec 6, 2025Updated 3 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆649Oct 16, 2024Updated last year
- ☆631Mar 3, 2026Updated last week
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,937Aug 15, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆443Aug 8, 2025Updated 7 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 8 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆997Nov 25, 2025Updated 3 months ago
- ☆15Nov 11, 2024Updated last year
- ComfyUI version of WithAnyone☆24Dec 18, 2025Updated 2 months ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆20Updated this week
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,560Mar 16, 2025Updated 11 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆314Apr 29, 2024Updated last year
- Next-Token Prediction is All You Need☆2,367Jan 12, 2026Updated last month
- Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.☆636Feb 12, 2026Updated 3 weeks ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆383Mar 26, 2025Updated 11 months ago
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated last month
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆513Nov 14, 2025Updated 3 months ago
- ☆2,498Jul 16, 2025Updated 7 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆168Nov 18, 2024Updated last year
- implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880☆322Feb 17, 2026Updated 3 weeks ago
- Towards Scalable Pre-training of Visual Tokenizers for Generation☆449Dec 16, 2025Updated 2 months ago
- ☆3,177Mar 17, 2025Updated 11 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆640Oct 16, 2025Updated 4 months ago