UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
☆94Feb 5, 2026Updated last month
Alternatives and similar repositories for UniPercept
Users that are interested in UniPercept are comparing it to the libraries listed below
Sorting:
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆66Sep 15, 2025Updated 5 months ago
- [CVPR 2026] ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding(书生 · 妙析多模态美学理解大模型)☆135Feb 25, 2026Updated last week
- Project page of "GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors"☆23Jul 1, 2024Updated last year
- 3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆12Jun 19, 2025Updated 8 months ago
- Instance-level Facial Attributes Editing (CVIU 2021)☆15Jul 19, 2022Updated 3 years ago
- LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation☆38Mar 3, 2025Updated last year
- ☆18Jan 19, 2026Updated last month
- This is the official implementation of "DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization"☆27May 28, 2025Updated 9 months ago
- [NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions☆49Oct 23, 2025Updated 4 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆50Feb 21, 2026Updated last week
- ☆49Feb 9, 2026Updated 3 weeks ago
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Sep 20, 2025Updated 5 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Sep 8, 2025Updated 5 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- A minimal PyTorch implementation of Flow Matching for Generative Modeling.☆37Sep 26, 2025Updated 5 months ago
- The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"☆49Sep 28, 2025Updated 5 months ago
- ☆28Apr 15, 2024Updated last year
- Toward Generalizing Visual Brain Decoding to Unseen Subjects☆28May 14, 2025Updated 9 months ago
- Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models☆56Jan 3, 2026Updated 2 months ago
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆53Jan 5, 2026Updated 2 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆45Nov 24, 2025Updated 3 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Nov 25, 2024Updated last year
- Code for Ray Conditioning☆30Feb 9, 2024Updated 2 years ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Jul 19, 2024Updated last year
- [ Official ] - Toward Interactive Modulation for Photo-Realistic Image Restoration. CVPRW 2021 NTIRE.☆23Feb 23, 2022Updated 4 years ago
- ☆57Feb 2, 2026Updated last month
- ECCV2024:A Comparative Study of Image Restoration Networks for General Backbone Network Design☆135Jan 7, 2025Updated last year
- ☆37Jun 20, 2024Updated last year
- [Preprint] UCGM: Unified Continuous Generative Models☆182May 27, 2025Updated 9 months ago
- ☆37Mar 21, 2025Updated 11 months ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 2 months ago
- [CVPR 2026] A training-free, mask-free framework for 3D shape editing.☆25Dec 12, 2025Updated 2 months ago
- ☆10Apr 16, 2020Updated 5 years ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆31Jul 18, 2024Updated last year
- ☆31Jul 30, 2016Updated 9 years ago
- Pytorch implementation of Self-Refining Video Sampling☆146Feb 6, 2026Updated last month
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆84Feb 13, 2026Updated 3 weeks ago
- MILO perceptual quality metric☆22Dec 8, 2025Updated 2 months ago
- PyTorch使用技巧和教程☆11Apr 17, 2023Updated 2 years ago