CVI-SZU / FaceBenchLinks
[CVPR 2025] FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
☆39Updated 2 months ago
Alternatives and similar repositories for FaceBench
Users that are interested in FaceBench are comparing it to the libraries listed below
Sorting:
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆13Updated last year
- (ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detect…☆135Updated 3 months ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆80Updated 10 months ago
- The offical implementation of 'FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant'☆44Updated 11 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆27Updated 10 months ago
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆67Updated 2 weeks ago
- ☆25Updated last month
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Updated 2 years ago
- [WACV'25 Oral] Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer☆50Updated 8 months ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆39Updated last month
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆47Updated last year
- ☆54Updated 2 weeks ago
- SEED Dataset☆29Updated 5 months ago
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆109Updated 2 weeks ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆107Updated 3 weeks ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆44Updated last year
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆41Updated 7 months ago
- ☆75Updated 6 months ago
- Millions-Level Face/Human-Scene Image-Text Datasets☆23Updated 5 months ago
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆25Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆146Updated 9 months ago
- [ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding☆59Updated 4 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆113Updated 3 weeks ago
- [ICLR 2025] Diffusion Feedback Helps CLIP See Better☆292Updated 9 months ago
- The offical repository of "So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection"☆18Updated last week
- ☆26Updated last week
- [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling☆134Updated 2 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆156Updated 7 months ago
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆239Updated last year
- [NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆81Updated last month