MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]
☆74Jan 21, 2026Updated last month
Alternatives and similar repositories for monetGPT
Users that are interested in monetGPT are comparing it to the libraries listed below
Sorting:
- ☆43Sep 1, 2025Updated 6 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆113Sep 27, 2025Updated 5 months ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆23Feb 14, 2026Updated 2 weeks ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆75Dec 8, 2025Updated 2 months ago
- ☆30Jul 8, 2024Updated last year
- [CVPR 2025 满分论文 Ratings: 555]☆37May 9, 2025Updated 9 months ago
- ☆52Jan 6, 2026Updated 2 months ago
- [Unofficial Implementation] Subject-driven Video Generation via Disentangled Identity and Motion☆58Jan 5, 2026Updated 2 months ago
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆92Feb 9, 2026Updated 3 weeks ago
- ☆15Mar 27, 2023Updated 2 years ago
- ☆50Updated this week
- WACV2025☆32Mar 3, 2025Updated last year
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- Code and data for the paper: AI Sees Your Location—But With A Bias Toward The Wealthy World☆18Dec 15, 2025Updated 2 months ago
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆18Mar 22, 2025Updated 11 months ago
- [ICCV 25] Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation☆38Dec 16, 2025Updated 2 months ago
- Structured Noise Generation from the paper NeuralRemaster with Phase-Preserving Diffusion☆31Feb 1, 2026Updated last month
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆32Dec 13, 2025Updated 2 months ago
- ☆16Dec 31, 2021Updated 4 years ago
- [ICLR 2026] DiMeR: Disentangled Mesh Reconstruction Model with Normal-only Geometry Training☆51May 26, 2025Updated 9 months ago
- ☆47Apr 20, 2025Updated 10 months ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆72Jan 9, 2026Updated last month
- Extract LoRA from the original Fine-Tuned model. 从微调模型中提取lora。☆20May 5, 2025Updated 10 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆133Nov 27, 2025Updated 3 months ago
- ☆27Jun 3, 2025Updated 9 months ago
- InstructPix2Pix with distilled diffusion models☆24Jun 30, 2024Updated last year
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆32Sep 27, 2025Updated 5 months ago
- ☆50May 27, 2023Updated 2 years ago
- ☆208May 13, 2025Updated 9 months ago
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142May 30, 2025Updated 9 months ago
- Official Repository for ICLR 2026 paper Durian: Dual Reference Image-Guided Portrait Animation with Attribute Transfer☆37Dec 8, 2025Updated 2 months ago
- ☆27Mar 3, 2025Updated last year
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆95Nov 21, 2025Updated 3 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Feb 11, 2025Updated last year
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- [CVPR 2025] Official code of "PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation"☆41Mar 18, 2025Updated 11 months ago