MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]
☆85Jan 21, 2026Updated 3 months ago
Alternatives and similar repositories for monetGPT
Users that are interested in monetGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025 满分论文 Ratings: 555]☆38May 9, 2025Updated 11 months ago
- Structured Noise Generation from the paper NeuralRemaster with Phase-Preserving Diffusion☆36Feb 1, 2026Updated 3 months ago
- WACV2025☆33Mar 3, 2025Updated last year
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆18Mar 22, 2025Updated last year
- ☆43Sep 1, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A fully automated, intelligent photo-editing agent that autonomously plans multi-step aesthetic enhancements, smartly chooses diverse edi…☆44Mar 12, 2026Updated last month
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 2 months ago
- ☆30Jul 8, 2024Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- InstructPix2Pix with distilled diffusion models☆24Jun 30, 2024Updated last year
- Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"☆112Apr 15, 2025Updated last year
- [ICLR 2026] DiMeR: Disentangled Mesh Reconstruction Model with Normal-only Geometry Training☆52May 26, 2025Updated 11 months ago
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142May 30, 2025Updated 11 months ago
- [Unofficial Implementation] Subject-driven Video Generation via Disentangled Identity and Motion☆58Jan 5, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆53Aug 12, 2024Updated last year
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆38Sep 27, 2025Updated 7 months ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆39Dec 13, 2025Updated 4 months ago
- ☆15Mar 27, 2023Updated 3 years ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆245Aug 15, 2025Updated 8 months ago
- ☆53Jan 6, 2026Updated 4 months ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 3 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆26Dec 8, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2025] DP²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution☆79Dec 20, 2025Updated 4 months ago
- An efficient distillation method for flow matching models☆26Feb 1, 2026Updated 3 months ago
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆175Oct 15, 2025Updated 6 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆103Feb 11, 2025Updated last year
- SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer☆47Nov 10, 2025Updated 5 months ago
- Extract LoRA from the original Fine-Tuned model. 从微调模型中提取lora。☆20May 5, 2025Updated last year
- [MM 2023] Toward High Quality Facial Representation Learning☆19Oct 30, 2023Updated 2 years ago
- [CVPR 2025] Official code of "PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation"☆50Mar 18, 2025Updated last year
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆24Jan 27, 2026Updated 3 months ago
- an unofficial implementation of the paper "Neural Preset for Color Style Transfer" (CVPR 2023), including ~300 LUT files☆51Oct 11, 2025Updated 6 months ago
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆104Feb 9, 2026Updated 2 months ago
- MILO perceptual quality metric☆25Updated this week
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆45Oct 19, 2025Updated 6 months ago
- ☆132Jun 24, 2025Updated 10 months ago
- ☆16Jun 14, 2024Updated last year