②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.
☆235Aug 12, 2024Updated last year
Alternatives and similar repositories for Q-Instruct
Users that are interested in Q-Instruct are comparing it to the libraries listed below
Sorting:
- ①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and vi…☆282Aug 12, 2024Updated last year
- ④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a bench…☆86Sep 29, 2024Updated last year
- [ICME 2023 Oral, Extended to TIP (UR)] The best zero-shot VQA approach that even outperforms several fully-supervised methods.☆40Jul 11, 2023Updated 2 years ago
- ③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.☆575Mar 12, 2025Updated 11 months ago
- Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)☆44Oct 25, 2024Updated last year
- [ACMMM 2024] AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception☆101Jan 19, 2025Updated last year
- [ACMMM Oral, 2023] "Towards Explainable In-the-wild Video Quality Assessment: A Database and a Language-Prompted Approach"☆85Aug 12, 2024Updated last year
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.☆256Feb 4, 2025Updated last year
- [ECCV 2024] Official Pytorch Implementation of A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment☆92Jul 20, 2024Updated last year
- [CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective☆235Jan 10, 2025Updated last year
- [Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors☆48Feb 20, 2025Updated last year
- DepictQA: Depicted Image Quality Assessment with Vision Language Models☆198Nov 28, 2025Updated 3 months ago
- [TPAMI] Multi-modality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing☆25Jul 3, 2025Updated 7 months ago
- [ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspect…☆487Aug 12, 2024Updated last year
- [IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database for AI Generated Image Quality Assessment☆69Oct 24, 2023Updated 2 years ago
- AGIQA-1k-Database for AI Generated Content Image Quality Assessment☆29May 1, 2023Updated 2 years ago
- [TCSVT'24] Offical Implementation of 2AFC-LMMs☆12Aug 17, 2024Updated last year
- [IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images☆37Feb 9, 2026Updated 2 weeks ago
- [ECCV2022, TPAMI2023] FAST-VQA, and its extended version FasterVQA.☆337Aug 12, 2024Updated last year
- [ICLR'26] "Grounding-IQA: Grounding Multimodal Language Model for Image Quality Assessment"☆58Jan 27, 2026Updated last month
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- LMM for VQA, tcsvt version☆11Jul 19, 2024Updated last year
- [ACMMM2025] Official released code for VQA² series models☆61Oct 19, 2025Updated 4 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [CVPR 2025 满分论文 Ratings: 555]☆37May 9, 2025Updated 9 months ago
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142May 30, 2025Updated 9 months ago
- Official code for "Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization"☆16Aug 7, 2024Updated last year
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- [official] Unified Quality Assessment of In-the-Wild Videos with Mixed Datasets Training (IJCV 2021)☆92Sep 16, 2022Updated 3 years ago
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.☆113Jul 27, 2024Updated last year
- Patch-VQ: ‘Patching Up’ the Video Quality Problem☆76Jan 29, 2025Updated last year
- [AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images☆477Oct 27, 2023Updated 2 years ago
- ☆27Sep 28, 2024Updated last year
- PyTorch code for our paper "Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grain Image Quality Assessment"☆28Oct 7, 2024Updated last year
- [TCSVT 2025] Official code release of our paper "Towards Explainable Image Aesthetics Assessment With Attribute-Oriented Critiques Genera…☆22Oct 26, 2025Updated 4 months ago
- [ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"☆21Jul 9, 2024Updated last year
- [ACMMM 2025] Benchmarking MLLM Codec Ability☆33Jun 14, 2024Updated last year
- Official implementation for "Seagull: No-reference Image Quality Assessment for Regions of Interest via Visual-Language Instruction Tunin…☆58Mar 7, 2025Updated 11 months ago