[ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from user instructions.
☆210May 5, 2025Updated 10 months ago
Alternatives and similar repositories for PixWizard
Users that are interested in PixWizard are comparing it to the libraries listed below
Sorting:
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆142Jan 27, 2025Updated last year
- JoPano: Unified Panorama Generation via Joint Modeling☆24Mar 6, 2026Updated 2 weeks ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 10 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆643Oct 16, 2025Updated 5 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆191Dec 31, 2024Updated last year
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆357Mar 20, 2025Updated last year
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆324Apr 24, 2025Updated 10 months ago
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 9 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Official implementation of OneDiffusion paper (CVPR 2025)☆665Dec 14, 2024Updated last year
- Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"☆205Apr 1, 2025Updated 11 months ago
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Jan 2, 2025Updated last year
- Next-Token Prediction is All You Need☆2,374Jan 12, 2026Updated 2 months ago
- ☆41Jan 4, 2026Updated 2 months ago
- Multimodal Models in Real World☆558Feb 24, 2025Updated last year
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,905Jul 3, 2025Updated 8 months ago
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆240May 5, 2025Updated 10 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆99Jun 30, 2025Updated 8 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆129Nov 29, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 3 months ago
- ☆271Jul 23, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Jul 16, 2025Updated 8 months ago
- Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024☆112Dec 23, 2024Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 8 months ago
- ☆111Jul 9, 2024Updated last year
- Empowering Unified MLLM with Multi-granular Visual Generation☆130Jan 16, 2025Updated last year
- [ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"☆263Mar 6, 2026Updated 2 weeks ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,895Jan 8, 2026Updated 2 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆149Oct 9, 2025Updated 5 months ago
- [🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!☆619May 1, 2025Updated 10 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆312Sep 28, 2025Updated 5 months ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,313Dec 4, 2025Updated 3 months ago
- [ICCV 25] OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting☆316Oct 23, 2025Updated 4 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆860Updated this week
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,553Nov 10, 2025Updated 4 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Sep 1, 2025Updated 6 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 4 months ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆164Oct 23, 2024Updated last year