CVI-SZU / FaceBenchLinks
[CVPR 2025] FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
☆30Updated 2 months ago
Alternatives and similar repositories for FaceBench
Users that are interested in FaceBench are comparing it to the libraries listed below
Sorting:
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆17Updated 2 months ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆13Updated this week
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆15Updated 3 months ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆70Updated 6 months ago
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆13Updated last year
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆26Updated 2 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆77Updated last month
- The offical implementation of 'FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant'☆44Updated 7 months ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆46Updated 10 months ago
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆24Updated 4 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆41Updated 8 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆120Updated 2 weeks ago
- The official repository for ACL2025 paper "PruneVid: Visual Token Pruning for Efficient Video Large Language Models".☆46Updated last month
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆62Updated 2 weeks ago
- ICCV2023-Diffusion-Papers☆108Updated last year
- ☆21Updated 5 months ago
- Replication in Visual Diffusion Models: A Survey and Outlook☆29Updated 10 months ago
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆49Updated last month
- ☆80Updated 7 months ago
- ☆14Updated last month
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆30Updated 3 months ago
- Unified layout planning and image generation☆21Updated 2 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Updated last year
- [ICLR'25] Reconstructive Visual Instruction Tuning☆92Updated 2 months ago
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆128Updated this week
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆56Updated 9 months ago
- MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer☆44Updated 9 months ago
- MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆40Updated 2 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆40Updated last year
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆86Updated 3 weeks ago