CVI-SZU / FaceBenchLinks
[CVPR 2025] FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
☆32Updated 3 months ago
Alternatives and similar repositories for FaceBench
Users that are interested in FaceBench are comparing it to the libraries listed below
Sorting:
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆13Updated last year
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆20Updated last week
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆19Updated 3 months ago
- (ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detect…☆81Updated this week
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆26Updated 4 months ago
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆71Updated this week
- VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆53Updated 3 months ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆73Updated 6 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆147Updated 4 months ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆94Updated last month
- The offical implementation of 'FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant'☆43Updated 7 months ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆90Updated 2 weeks ago
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆225Updated last year
- Unified layout planning and image generation, ICCV2025☆27Updated 3 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆86Updated 2 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆101Updated last month
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆79Updated 2 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆26Updated 6 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆129Updated last month
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆164Updated 3 months ago
- The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆42Updated last month
- ☆89Updated 3 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆355Updated last week
- 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant☆110Updated 3 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆123Updated 6 months ago
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆84Updated 6 months ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆98Updated 3 months ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆44Updated last month
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆51Updated 2 months ago
- ☆21Updated 6 months ago