Kartik-3004 / facexbench
FaceXBench: Evaluating Multimodal LLMs on Face Understanding
☆13Updated last month
Alternatives and similar repositories for facexbench:
Users that are interested in facexbench are comparing it to the libraries listed below
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆16Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 7 months ago
- ☆33Updated last year
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆30Updated this week
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆30Updated last month
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆49Updated 2 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated last year
- The official repo of continuous speculative decoding☆25Updated this week
- Official implementation for "Diffusion Instruction Tuning"☆19Updated last month
- ☆49Updated 3 months ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 3 weeks ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆66Updated 10 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆66Updated 5 months ago
- ☆33Updated last month
- ☆42Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 5 months ago
- ☆17Updated 5 months ago
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆20Updated last year
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆37Updated 11 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆45Updated 2 months ago
- ☆17Updated 4 months ago
- ☆33Updated last year
- ☆41Updated last year
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆57Updated last month
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆62Updated 10 months ago
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆15Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆122Updated 7 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆67Updated 10 months ago
- ☆40Updated 8 months ago
- ☆27Updated 2 months ago