②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.
☆235Aug 12, 2024Updated last year
Alternatives and similar repositories for Q-Instruct
Users that are interested in Q-Instruct are comparing it to the libraries listed below
Sorting:
- ①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and vi…☆282Aug 12, 2024Updated last year
- ④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a bench…☆86Sep 29, 2024Updated last year
- [ICME 2023 Oral, Extended to TIP (UR)] The best zero-shot VQA approach that even outperforms several fully-supervised methods.☆41Jul 11, 2023Updated 2 years ago
- ③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.☆580Mar 12, 2025Updated last year
- [ACMMM Oral, 2023] "Towards Explainable In-the-wild Video Quality Assessment: A Database and a Language-Prompted Approach"☆86Aug 12, 2024Updated last year
- Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)☆44Oct 25, 2024Updated last year
- [ACMMM 2024] AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception☆102Jan 19, 2025Updated last year
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- [ECCV 2024] Official Pytorch Implementation of A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment☆92Jul 20, 2024Updated last year
- [CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective☆236Jan 10, 2025Updated last year
- [ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspect…☆492Aug 12, 2024Updated last year
- [Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors☆48Feb 20, 2025Updated last year
- An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.☆256Feb 4, 2025Updated last year
- DepictQA: Depicted Image Quality Assessment with Vision Language Models☆203Nov 28, 2025Updated 3 months ago
- [TPAMI] Multi-modality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing☆25Jul 3, 2025Updated 8 months ago
- [ECCV2022, TPAMI2023] FAST-VQA, and its extended version FasterVQA.☆338Aug 12, 2024Updated last year
- LMM for VQA, tcsvt version☆10Jul 19, 2024Updated last year
- [TCSVT'24] Offical Implementation of 2AFC-LMMs☆12Aug 17, 2024Updated last year
- [ICLR'26] "Grounding-IQA: Grounding Multimodal Language Model for Image Quality Assessment"☆60Jan 27, 2026Updated last month
- [IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database for AI Generated Image Quality Assessment☆69Oct 24, 2023Updated 2 years ago
- [ACMMM2025] Official released code for VQA² series models☆61Oct 19, 2025Updated 5 months ago
- [IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images☆37Feb 9, 2026Updated last month
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- AGIQA-1k-Database for AI Generated Content Image Quality Assessment☆29May 1, 2023Updated 2 years ago
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- [official] Unified Quality Assessment of In-the-Wild Videos with Mixed Datasets Training (IJCV 2021)☆92Sep 16, 2022Updated 3 years ago
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.☆115Jul 27, 2024Updated last year
- [CVPR 2025 满分论文 Ratings: 555]☆37May 9, 2025Updated 10 months ago
- Official code for "Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization"☆17Aug 7, 2024Updated last year
- [TCSVT 2025] Official code release of our paper "Towards Explainable Image Aesthetics Assessment With Attribute-Oriented Critiques Genera…☆22Oct 26, 2025Updated 4 months ago
- [AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images☆481Oct 27, 2023Updated 2 years ago
- [ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"☆21Jul 9, 2024Updated last year
- [ACMMM 2025] Benchmarking MLLM Codec Ability☆33Jun 14, 2024Updated last year
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142May 30, 2025Updated 9 months ago
- Enhancing Blind Video Quality Assessment with Rich Quality-aware Features☆62Jul 4, 2024Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- 👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRIS…☆3,158Updated this week
- [ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?☆139Feb 3, 2025Updated last year
- [CVPR'20] Official SPAQ & Implementation☆194Jan 17, 2024Updated 2 years ago