clh124 / VQAThinkerLinks
[AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning
☆17Updated last month
Alternatives and similar repositories for VQAThinker
Users that are interested in VQAThinker are comparing it to the libraries listed below
Sorting:
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆38Updated 7 months ago
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆63Updated 4 months ago
- ④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a bench…☆86Updated last year
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆89Updated 4 months ago
- MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]☆72Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- [TBench 2024] Official implementation of "AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI"☆48Updated last year
- [ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Updated 9 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 8 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Updated 6 months ago
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆195Updated last month
- [CVPR 2025] AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM☆18Updated 5 months ago
- [CVPR 2025 满分论文 Ratings: 555]☆36Updated 8 months ago
- [ACMMM2025] Official released code for VQA² series models☆60Updated 2 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- ☆19Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- Training Autoregressive Image Generation models via Reinforcement Learning☆48Updated last month
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆30Updated 6 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆52Updated last year
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆58Updated 3 months ago
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆148Updated 3 months ago
- [ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation☆67Updated 9 months ago
- OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)☆34Updated 5 months ago
- Unified Multi-modal IAA Baseline and Benchmark☆91Updated last year
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆44Updated 7 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆84Updated last year
- [MM 2024 Oral] Refiner for AIGC☆29Updated last year
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Updated last year
- Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')☆66Updated 2 years ago