[IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding
☆20Jan 15, 2026Updated 4 months ago
Alternatives and similar repositories for facexbench
Users that are interested in facexbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Separable Diffusion Model Unlearning☆13Jan 29, 2025Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- [Doc] Productive Deep Learner☆14Feb 18, 2025Updated last year
- Spiideo SoccerNet SynLoc - Single Frame World Coordinate Athlete Detection and Localization with Synthetic Data☆22Mar 27, 2026Updated 2 months ago
- 3D Traffic Light & Sign Dataset☆26Mar 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆26Jun 8, 2025Updated 11 months ago
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated 4 months ago
- Code for the paper Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation (CVPR 2023).☆34May 26, 2023Updated 3 years ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆18Feb 23, 2026Updated 3 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 11 months ago
- LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft☆46Jul 17, 2024Updated last year
- Unofficial mirror☆12Jul 13, 2017Updated 8 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- ☆12Mar 28, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"☆38Mar 11, 2026Updated 2 months ago
- [⭐ CVPR 2025 Highlight ⭐] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing fro…☆31Apr 22, 2025Updated last year
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 10 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆28May 11, 2026Updated 2 weeks ago
- [FCCM 2023] PASTA: Programming and Automation Support for Scalable Task-Parallel HLS Programs on Modern Multi-Die FPGAs☆14Jun 26, 2025Updated 11 months ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated 2 years ago
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated 2 years ago
- Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"☆13Jan 19, 2024Updated 2 years ago
- Source code for "A deep dive into reinforcement learning"☆13Dec 17, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated 2 years ago
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆44Mar 15, 2024Updated 2 years ago
- 一个简单的opencv+dlib在Android的例程☆10Nov 23, 2018Updated 7 years ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆30Oct 30, 2023Updated 2 years ago
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆59Mar 13, 2026Updated 2 months ago
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- ☆17Feb 21, 2025Updated last year
- ☆14Dec 21, 2023Updated 2 years ago
- ☆13Dec 12, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Object recognition in satellite images (Dior Dataset) using RetinaNet and YoloV5☆20Jan 23, 2021Updated 5 years ago
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆17May 14, 2025Updated last year
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆32Dec 8, 2023Updated 2 years ago
- API module for interacting with and receiving events from August smart locks☆13Nov 1, 2023Updated 2 years ago
- ☆37Jan 9, 2025Updated last year
- Probabilistic Jacobian-based Saliency Maps Attacks☆19Nov 28, 2020Updated 5 years ago
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding☆291Aug 5, 2025Updated 9 months ago