[ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from user instructions.
☆210May 5, 2025Updated 9 months ago
Alternatives and similar repositories for PixWizard
Users that are interested in PixWizard are comparing it to the libraries listed below
Sorting:
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆141Jan 27, 2025Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 10 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆191Dec 31, 2024Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 4 months ago
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆355Mar 20, 2025Updated 11 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆322Apr 24, 2025Updated 10 months ago
- Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"☆204Apr 1, 2025Updated 11 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Jan 2, 2025Updated last year
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆240May 5, 2025Updated 9 months ago
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 9 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆72Jul 16, 2025Updated 7 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆99Jun 30, 2025Updated 8 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)☆664Dec 14, 2024Updated last year
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,903Jul 3, 2025Updated 7 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆148Oct 9, 2025Updated 4 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- Next-Token Prediction is All You Need☆2,355Jan 12, 2026Updated last month
- Multimodal Models in Real World☆555Feb 24, 2025Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 8 months ago
- ☆271Jul 23, 2024Updated last year
- [🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!☆617May 1, 2025Updated 10 months ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆160Oct 23, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,875Jan 8, 2026Updated last month
- Training-free Regional Prompting for Diffusion Transformers 🔥☆694Nov 28, 2024Updated last year
- JoPano: Unified Panorama Generation via Joint Modeling☆23Dec 12, 2025Updated 2 months ago
- ☆110Jul 9, 2024Updated last year
- Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024☆113Dec 23, 2024Updated last year
- Empowering Unified MLLM with Multi-granular Visual Generation☆129Jan 16, 2025Updated last year
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆372May 21, 2025Updated 9 months ago
- [ICCV 25] OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting☆315Oct 23, 2025Updated 4 months ago
- [ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"☆262Jan 12, 2026Updated last month
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆310Sep 28, 2025Updated 5 months ago
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆828Aug 30, 2025Updated 6 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 3 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆97Jan 17, 2025Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Nov 29, 2024Updated last year