niladridutt / monetGPTLinks
MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]
☆55Updated 3 weeks ago
Alternatives and similar repositories for monetGPT
Users that are interested in monetGPT are comparing it to the libraries listed below
Sorting:
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆60Updated 6 months ago
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆31Updated last year
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆129Updated 9 months ago
- ☆33Updated 7 months ago
- OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)☆31Updated 2 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆124Updated 11 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆60Updated last year
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Updated last year
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆84Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆69Updated 3 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆67Updated last year
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆116Updated 2 weeks ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆21Updated 7 months ago
- ☆41Updated 9 months ago
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆64Updated last month
- [ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.o…☆83Updated last year
- SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)☆68Updated 2 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆87Updated last year
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆75Updated last year
- [CVPR 2025] Official implementation of StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements☆145Updated 2 months ago
- [CVPR 2025 Highlight] Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis☆154Updated 3 weeks ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆46Updated 2 months ago
- [ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation☆66Updated 7 months ago
- Training Autoregressive Image Generation models via Reinforcement Learning☆44Updated 2 months ago
- ☆33Updated last year
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆15Updated last year
- [NeurIPS2024] Overcome hallucination of diffusion restoration models.☆55Updated 6 months ago
- ☆48Updated last month
- ☆33Updated 11 months ago