[ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from user instructions.
☆209May 5, 2025Updated last year
Alternatives and similar repositories for PixWizard
Users that are interested in PixWizard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆145Jan 27, 2025Updated last year
- JoPano: Unified Panorama Generation via Joint Modeling☆24Mar 6, 2026Updated 3 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆647Oct 16, 2025Updated 7 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆191Dec 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆361Mar 26, 2026Updated 2 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆325Apr 24, 2025Updated last year
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆99May 30, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,953Aug 15, 2024Updated last year
- Official implementation of OneDiffusion paper (CVPR 2025)☆661Dec 14, 2024Updated last year
- Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"☆208Apr 1, 2025Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Jan 2, 2025Updated last year
- Next-Token Prediction is All You Need☆2,417Jan 12, 2026Updated 4 months ago
- Multimodal Models in Real World☆558Feb 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,913Jul 3, 2025Updated 11 months ago
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆239May 5, 2025Updated last year
- ☆45Jan 4, 2026Updated 5 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆101Jun 30, 2025Updated 11 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆129Nov 29, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,251Feb 16, 2025Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,011Nov 25, 2025Updated 6 months ago
- ☆272Jul 23, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆72Jul 16, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024☆111Dec 23, 2024Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆165Jun 26, 2025Updated 11 months ago
- ☆112Jul 9, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,944Jan 8, 2026Updated 5 months ago
- [ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"☆268Mar 6, 2026Updated 3 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆132Jan 16, 2025Updated last year
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆149Oct 9, 2025Updated 8 months ago
- [🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!☆630May 1, 2025Updated last year
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,326Dec 4, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆315Sep 28, 2025Updated 8 months ago
- [ICCV 25] OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting☆325Mar 27, 2026Updated 2 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆865Mar 19, 2026Updated 2 months ago
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,570Apr 16, 2026Updated last month
- PICABench: How Far Are We from Physically Realistic Image Editing?