zijianchen98 / GAIA
[NeurIPS2024 D&B Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
☆26Updated 2 weeks ago
Alternatives and similar repositories for GAIA:
Users that are interested in GAIA are comparing it to the libraries listed below
- Official released code for VQA² series models☆39Updated 3 weeks ago
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Updated last year
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…☆29Updated last year
- ☆21Updated last year
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆18Updated 2 months ago
- [ICME 2023 Oral, Extended to TIP (UR)] The best zero-shot VQA approach that even outperforms several fully-supervised methods.☆37Updated last year
- Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)☆40Updated 5 months ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆34Updated last year
- ☆14Updated 3 weeks ago
- [IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images☆34Updated 2 months ago
- The official repository for paper "PruneVid: Visual Token Pruning for Efficient Video Large Language Models".☆35Updated 2 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆31Updated 3 weeks ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆41Updated 6 months ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆78Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆43Updated 3 weeks ago
- ④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a bench…☆78Updated 6 months ago
- AGIQA-1k-Database for AI Generated Content Image Quality Assessment☆27Updated last year
- AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM☆12Updated 2 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆18Updated 5 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆16Updated 9 months ago
- [ACMMM Oral, 2023] "Towards Explainable In-the-wild Video Quality Assessment: A Database and a Language-Prompted Approach"☆77Updated 8 months ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆28Updated 6 months ago
- [TPAMI] Multi-modality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing☆17Updated 5 months ago
- Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary☆17Updated 2 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated last year
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆31Updated last week
- ☆29Updated last year
- [IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment☆51Updated last year
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆12Updated 10 months ago