CVI-SZU / FaceBench
[CVPR 2025] FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
☆19Updated 3 weeks ago
Alternatives and similar repositories for FaceBench
Users that are interested in FaceBench are comparing it to the libraries listed below
Sorting:
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆13Updated last month
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆12Updated 2 months ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆20Updated last month
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆12Updated 10 months ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆13Updated last month
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆69Updated 4 months ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆12Updated 6 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆73Updated last week
- My implement of InstantBooth☆12Updated last year
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆83Updated last month
- ☆13Updated last month
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆118Updated 4 months ago
- Spatial-R1: The first MLLM trained using GRPO for spatial reasoning in videos☆33Updated this week
- Frequency Autoregressive Image Generation with Continuous Tokens☆63Updated 2 months ago
- Unifying Visual Understanding and Generation with Dual Visual Vocabularies 🌈☆43Updated 3 weeks ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆134Updated last month
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆55Updated 6 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆40Updated last year
- Accepted by CVPR 2024☆33Updated 11 months ago
- ☆101Updated this week
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆33Updated last month
- FQGAN: Factorized Visual Tokenization and Generation☆50Updated last month
- code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"☆15Updated 2 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 4 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆41Updated 6 months ago
- ☆21Updated 3 months ago
- ☆18Updated 9 months ago
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆16Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆136Updated 3 months ago
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆49Updated 2 months ago