MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]
☆82Jan 21, 2026Updated 2 months ago
Alternatives and similar repositories for monetGPT
Users that are interested in monetGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025 满分论文 Ratings: 555]☆38May 9, 2025Updated 11 months ago
- A fully automated, intelligent photo-editing agent that autonomously plans multi-step aesthetic enhancements, smartly chooses diverse edi…☆33Mar 12, 2026Updated last month
- Structured Noise Generation from the paper NeuralRemaster with Phase-Preserving Diffusion☆36Feb 1, 2026Updated 2 months ago
- WACV2025☆32Mar 3, 2025Updated last year
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆18Mar 22, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆43Sep 1, 2025Updated 7 months ago
- ☆76Dec 8, 2025Updated 4 months ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 2 months ago
- [ICCV 25] Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation☆39Dec 16, 2025Updated 3 months ago
- ☆30Jul 8, 2024Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- [ICLR 2026] DiMeR: Disentangled Mesh Reconstruction Model with Normal-only Geometry Training☆52May 26, 2025Updated 10 months ago
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142May 30, 2025Updated 10 months ago
- [Unofficial Implementation] Subject-driven Video Generation via Disentangled Identity and Motion☆58Jan 5, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆52Aug 12, 2024Updated last year
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆38Sep 27, 2025Updated 6 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆116Sep 27, 2025Updated 6 months ago
- [ICCV 2025 Highlight] Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures. Official Code and Dataset☆56Feb 4, 2026Updated 2 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆37Dec 13, 2025Updated 4 months ago
- ☆15Mar 27, 2023Updated 3 years ago
- Code and data for the paper: AI Sees Your Location—But With A Bias Toward The Wealthy World☆18Dec 15, 2025Updated 4 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆246Aug 15, 2025Updated 8 months ago
- ☆52Jan 6, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 2 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- PyTorch implementation of DeepLab v2 (ResNet) + COCO-Stuff 10k/164k☆15Nov 7, 2018Updated 7 years ago
- An efficient distillation method for flow matching models☆25Feb 1, 2026Updated 2 months ago
- [NeurIPS 2025] DP²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution☆78Dec 20, 2025Updated 3 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆102Feb 11, 2025Updated last year
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆168Oct 15, 2025Updated 6 months ago
- SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer☆46Nov 10, 2025Updated 5 months ago
- [CVPR 2025] Official code of "PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation"☆46Mar 18, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The official implementation of CVPR 2023 paper: Inverting the Imaging Process by Learning an Implicit Camera Model☆32Oct 19, 2023Updated 2 years ago
- Extract LoRA from the original Fine-Tuned model. 从微调模型中提取lora。☆20May 5, 2025Updated 11 months ago
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆102Feb 9, 2026Updated 2 months ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆24Jan 27, 2026Updated 2 months ago
- an unofficial implementation of the paper "Neural Preset for Color Style Transfer" (CVPR 2023), including ~300 LUT files☆50Oct 11, 2025Updated 6 months ago
- MILO perceptual quality metric☆23Dec 8, 2025Updated 4 months ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆42Oct 19, 2025Updated 5 months ago